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Preface to Analysis of Functions of a 
Single Variable: A Detailed 
Development 1 



For Christy My Light 

I have written this book primarily for serious and talented mathematics scholars, seniors or first-year 
graduate students, who by the time they finish their schooling should have had the opportunity to study in 
some detail the great discoveries of our subject. What did we know and how and when did we know it? I 
hope this book is useful toward that goal, especially when it comes to the great achievements of that part 
of mathematics known as analysis. I have tried to write a complete and thorough account of the elementary 
theories of functions of a single real variable and functions of a single complex variable. Separating these 
two subjects does not at all jive with their development historically, and to me it seems unnecessary and 
potentially confusing to do so. On the other hand, functions of several variables seems to me to be a very 
different kettle of fish, so I have decided to limit this book by concentrating on one variable at a time. 

Everyone is taught (told) in school that the area of a circle is given by the formula A = trr 2 . We are also 
told that the product of two negatives is a positive, that you cant trisect an angle, and that the square root of 
2 is irrational. Students of natural sciences learn that e l7T = — 1 and that sin 2 + cos 2 = 1. More sophisticated 
students are taught the Fundamental Theorem of calculus and the Fundamental Theorem of Algebra. Some 
are also told that it is impossible to solve a general fifth degree polynomial equation by radicals. On the 
other hand, very few people indeed have the opportunity to find out precisely why these things are really 
true, and at the same time to realize just how intellectually deep and profound these "facts" are. Indeed, we 
mathematicians believe that these facts are among the most marvelous accomplishments of the human mind. 
Engineers and scientists can and do commit such mathematical facts to memory, and quite often combine 
them to useful purposes. However, it is left to us mathematicians to share the basic knowledge of why and 
how, and happily to us this is more a privilege than a chore. A large part of what makes the verification 
of such simple sounding and elementary truths so difficult is that we of necessity must spend quite a lot 
of energy determining what the relevant words themselves really mean. That is, to be quite careful about 
studying mathematics, we need to ask very basic questions: What is a circle? What are numbers? What 
is the definition of the area of a set in the Euclidean plane? What is the precise definition of numbers like 
ir,i, and e? We surely cannot prove that e l7r = — 1 without a clear definition of these particular numbers. 
The mathematical analysis story is a long one, beginning with the early civilizations, and in some sense only 
coming to a satisfactory completion in the late nineteenth century. It is a story of ideas, well worth learning. 

There are many many fantastic mathematical truths (facts), and it seems to me that some of them are 
so beautiful and fundamental to human intellectual development, that a student who wants to be called a 
mathematician, ought to know how to explain them, or at the very least should have known how to explain 
them at some point. Each professor might make up a slightly different list of such truths. Here is mine: 

1. The square root of 2 is a real number but is not a rational number. 
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2. The formula for the area of a circle of radius r is A = irr . 

3. The formula for the circumference of a circle of radius r is C = 2irr. 

4. e i7r = -1. 

5. The Fundamental Theorem of Calculus, f* f (t) dt = F (b) - F (a) . 

6. The Fundamental Theorem of Algebra, every nonconstant polynomial has at least one root in the 
complex numbers. 

7. It is impossible to trisect an arbitrary angle using only a compass and straight edge. 

Other mathematical marvels, such as the fact that there are more real numbers than there are rationals, 
the set of all sets is not a set, an arbitrary fifth degree polynomial equation can not be solved in terms of 
radicals, a simple closed curve divides the plain into exactly two components, there are an infinite number 
of primes, etc., are clearly wonderful results, but the seven in the list above are really of a more primary 
nature to me, an analyst, for they stem from the work of ancient mathematicians and except for number 7, 
which continues to this day to evoke so-called disproofs, have been accepted as true by most people even in 
the absence of precise "arguments" for hundreds if not thousands of years. Perhaps one should ruminate on 
why it took so long for us to formulate precise definitions of things like numbers and areas? 

Only with the advent of calculus in the seventeenth century, together with the contributions of people 
like Euler, Cauchy, and Weierstrass during the next two hundred years, were the first six items above really 
proved, and only with the contributions of Galois in the early nineteenth century was the last one truly 
understood. 

This text, while including a traditional treatment of introductory analysis, specifically addresses, as kinds 
of milestones, the first six of these truths and gives careful derivations of them. The seventh, which looks 
like an assertion from geometry, turns out to be an algebraic result that is not appropriate for this course in 
analysis, but in my opinion it should definitely be presented in an undergraduate algebra course. As for the 
first six, I insist here on developing precise mathematical definitions of all the relevant notions, and moving 
step by step through their derivations. Specifically, what are the definitions of \/2,A, ir,r, r 2 , C, 2, e,i, , and 
— 1? My feeling is that mathematicians should understand exactly where these concepts come from in precise 
mathematical terms, why it took so long to discover these definitions, and why the various relations among 
them hold. 

The numbers —1,2, and i can be disposed of fairly quickly by a discussion of what exactly is meant by 
the real and complex number systems. Of course, this is in fact no trivial matter, having had to wait until 
the end of the nineteenth century for a clear explanation, and in fact I leave the actual proof of the existence 
of the real numbers to an appendix. However, a complete mathematics education ought to include a study 
of this proof, and if one finds the time in this analysis course, it really should be included here. Having a 
definition of the real numbers to work with, i.e., having introduced the notion of least upper bound, one 
can relatively easily prove that there is a real number whose square is 2, and that this number can not be 
a rational number, thereby disposing of the first of our goals. All this is done in Section 1.1. Maintaining 
the attitude that we should not distinguish between functions of a real variable and functions of a complex 
variable, at least at the beginning of the development, Section 1.1 concludes with a careful introduction of 
the basic properties of the field of complex numbers. 

unlike the elementary numbers —1,2, and i, the definitions of the real numbers e and n are quite a 
different story. In fact, one cannot make sense of either e or i until a substantial amount of analysis has 
been developed, for they both are necessarily defined somehow in terms of a limit process. I have chosen 
to define e here as the limit of the rather intriguing sequence {(l + -) }, in some ways the first nontrivial 
example of a convergent sequence, and this is presented in Section 2.1. Its relation to logarithms and 
exponentials, whatever they are, has to be postponed to Section 4.1. Section 2.1 also contains a section on 
the elementary topological properties (compactness, limit points, etc.) of the real and complex numbers as 
well as a thorough development of infinite series. 

To define ir as the ratio of the circumference of a circle to its diameter is attractive, indeed was quite 
acceptable to Euclid, but is dangerously imprecise unless we have at the outset a clear definition of what is 
meant by the length of a curve, e.g., the circumference of a circle. That notion is by no means trivial, and 
in fact it only can be carefully treated in a development of analysis well after other concepts. Rather, I have 



chosen to define 7r here as the smallest positive zero of the sine function. Of course, I have to define the 
sine function first, and this is itself quite deep. I do it using power series functions, choosing to avoid the 
common definition of the trigonometric functions in terms of " wrapping" the real line around a circle, for 
that notion again requires a precise definition of arc length before it would make sense. I get to arc length 
eventually, but not until Section 6.1. 

In Section 3.1 I introduce power series functions as generalizations of polynomials, specifically the three 
power series functions that turn out to be the exponential, sine, and cosine functions. From these definitions 
it follows directly that expiz = cosz + isinz for every complex number z. Here is a place where allowing the 
variable to be complex is critical, and it has cost us nothing. However, even after establishing that there is 
in fact a smallest positive zero of the sine function (which we decide to call n, since we know how we want 
things to work out), one cannot at this point deduce that cosir = — 1, so that the equality e ln = — 1 also has 
to wait for its derivation until Section 4.1. In fact, more serious, we have no knowledge at all at this point 
of the function e z for a complex exponent z. What does it mean to raise a real number, or even an integer, 
to a complex exponent? The very definition of such a function has to wait. 

Section 3.1 also contains all the standard theorems about continuous functions, culminating with a 
lengthy section on uniform convergence, and finally Abel's fantastic theorem on the continuity of a power 
series function on the boundary of its disk of convergence. 

The fourth chapter begins with all the usual theorems from calculus, Mean Value Theorem, Chain Rule, 
First Derivative Test, and so on. Power series functions are shown to be different iable, from which the 
law of exponents emerges for the power series function exp. Immediately then, all of the trigonometric 
and exponential identities are also derived. We observe that e r = exp (r) for every rational number r, and 
we at last can define consistently e z to be the value of the power series function exp (z) for any complex 
number z. From that, we establish the equation e" = — 1. Careful proofs of Taylor's Remainder Theorem and 
L'Hopital's Rule are given, as well as an initial approach to the general Binomial Theorem for non-integer 
exponents. 

It is in Section 4.1 that the first glimpse of a difference between functions of a real variable and functions 
of a complex variable emerges. For example, one of the results in this chapter is that every differentiable, 
real- valued function of a complex variable must be a constant function, something that is certainly not true 
for functions of a real variable. At the end of this chapter, I briefly slip into the realm of real- valued functions 
of two real variables. I introduce the definition of differentiability of such a function of two real variables, 
and then derive the initial relationships among the partial derivatives of such a function and the derivative 
of that function thought of as a function of a complex variable. This is obviously done in preparation for 
Chapter VII where holomorphic functions are central. 

Perhaps most well-understood by math majors is that computing the area under a curve requires Newton's 
calculus, i.e., integration theory. What is often overlooked by students is that the very definition of the 
concept of area is intimately tied up with this integration theory. My treatment here of integration differs 
from most others in that the class of functions defined as integrable are those that are uniform limits of 
step functions. This is a smaller collection of functions than those that are Riemann-integrable, but they 
suffice for my purposes, and this approach serves to emphasize the importance of uniform convergence. In 
particular, I include careful proofs of the Fundamental Theorem of Calculus, the integration by substitution 
theorem, the integral form of Taylor's Remainder Theorem, and the complete proof of the general Binomial 
Theorem. 

Not wishing to delve into the set-theoretic complications of measure theory, I have chosen only to define 
the area for certain "geometric" subsets of the plane. These are those subsets bounded above and below 
by graphs of continuous functions. Of course these suffice for most purposes, and in particular circles are 
examples of such geometric sets, so that the formula A = irr 2 can be established for the area of a circle 
of radius r. Section 5.1 concludes with a development of integration over geometric subsets of the plane. 
Once again, anticipating later needs, we have again strayed into some investigations of functions of two real 
variables. 

Having developed the notions of arc length in the early part of Section 6.1, including the derivation of 
the formula for the circumference of a circle, I introduce the idea of a contour integral, i.e., integrating a 



function around a curve in the complex plane. The Fundamental Theorem of Calculus has generalizations 
to higher dimensions, and it becomes Green's Theorem in 2 dimensions. I give a careful proof in Section 6.1, 
just over geometric sets, of this rather complicated theorem. 

Perhaps the main application of Green's Theorem is the Cauchy Integral Theorem, a result about 
complex- valued functions of a complex variable, that is often called the Fundamental Theorem of Analysis. I 
prove this theorem in Section 7.1. From this Cauchy theorem one can deduce the usual marvelous theorems 
of a first course in complex variables, e.g., the Identity Theorem, Liouville's Theorem, the Maximum Mod- 
ulus Principle, the Open Mapping Theorem, the Residue Theorem, and last but not least our mathematical 
truth number 6, the Fundamental Theorem of Algebra. That so much mathematical analysis is used to prove 
the fundamental theorem of algebra does make me smile. I will leave it to my algebraist colleagues to point 
out how some of the fundamental results in analysis require substantial algebraic arguments. 

The overriding philosophical point of this book is that many analytic assertions in mathematics are 
intellectually very deep; they require years of study for most people to understand; they demonstrate how 
intricate mathematical thought is and how far it has come over the years. Graduates in mathematics should 
be proud of the degree they have earned, and they should be proud of the depth of their understanding and 
the extremes to which they have pushed their own intellect. I love teaching these students, that is to say, I 
love sharing this marvelous material with them. 



Chapter 1 

The Real and Complex Numbers 



1.1 Definition of the Numbers 1, i, and the square root of 2 1 

In order to make precise sense out of the concepts we study in mathematical analysis, we must first come to 
terms with what the "real numbers" are. Everything in mathematical analysis is based on these numbers, 
and their very definition and existence is quite deep. We will, in fact, not attempt to demonstrate (prove) the 
existence of the real numbers in the body of this text, but will content ourselves with a careful delineation 
of their properties, referring the interested reader to an appendix for the existence and uniqueness proofs. 

Although people may always have had an intuitive idea of what these real numbers were, it was not until 
the nineteenth century that mathematically precise definitions were given. The history of how mathemati- 
cians came to realize the necessity for such precision in their definitions is fascinating from a philosophical 
point of view as much as from a mathematical one. However, we will not pursue the philosophical aspects 
of the subject in this book, but will be content to concentrate our attention just on the mathematical facts. 
These precise definitions are quite complicated, but the powerful possibilities within mathematical analysis 
rely heavily on this precision, so we must pursue them. Toward our primary goals, we will in this chapter 
give definitions of the symbols (numbers) —1, i, and \/2. 

The main points of this chapter are the following: 

1. The notions of least upper bound (supremum) and greatest lower bound (infimum) of a set of 
numbers, 

2. The definition of the real numbersi?, 

3. the formula for the sum of a geometric progression (Theorem 1.9, Geometric Progression, p. 19), 

4. the Binomial Theorem (Theorem 1.10, p. 20), and 

5. the triangle inequality for complex numbers (Theorem 1.15, Triangle Inequality, p. 26). 



1.2 The Natural Numbers and the Integers 2 

We will take for granted that we understand the existence of what we call the natural numbers, i.e., the 
set N whose elements are the numbers 1,2,3,4,.... Indeed, the two salient properties of this set are that 
(a) there is a frist element (the natural number 1), and (b) for each element n of this set there is a "very 
next" one, i.e., an immediate successor. We assume that the algebraic notions of sum and product of natural 
numbers is well-defined and familiar. These operations satisfy three basic relations: 
Basic Algebraic Relations. 

1. (Commutativity) n + m = m + n and n x m = m x n for all n,m G N. 



1 This content is available online at <http://cnx.Org/content/m36082/l.3/>. 
2 This content is available online at <http://cnx.Org/content/m36075/l.2/>. 



6 CHAPTER 1. THE REAL AND COMPLEX NUMBERS 

2. (Associativity) n + (m + k) = (n + m) + k and n x (m x k) = (n x m) x k for all n,m,k € JV. 

3. (Distributivity) n x (m+ k) = n x m + n x k for all n,m,k g AT. 

We also take as given the notion of one natural number being larger than another one. 2 > 1,5 > 3,n+ 1 > n, 
etc. We will accept as true the axiom of mathematical induction, that is, the following statement: 

1.1: 
AXIOM OF MATHEMATICAL INDUCTION. Let S be a subset of the set N of natural 
numbers. Suppose that 

1. leS. 

2. If a natural number k is in S, then the natural number k + 1 also is in S. 

Then S = N. 
That is, every natural number n belongs to S. 

1.2: 

REMARK The axiom of mathematical induction is for our purposes frequently employed as 
a method of proof. That is, if we wish to show that a certain proposition holds for all natural 
numbers, then we let S denote the set of numbers for which the proposition is true, and then, using 
the axiom of mathematical induction, we verify that S is all of N by showing that S satisfies both of 
the above conditions. Mathematical induction can also be used as a method of definition. That is, 
using it, we can define an infinite number of objects {O n } that are indexed by the natural numbers. 
Think of S as the set of natural numbers for which the object O n is defined. We check first to 
see that the object 0\ is defined. We check next that, if the object Ok is defined for a natural 
number k, then there is a prescribed procedure for defining the object Ok+i- So, by the axiom of 
mathematical induction, the object is defined for all natural numbers. This method of defining an 
infinite set of objects is often referred to as si recursive definition, or definition by recursion. 

As an example of recursive definition, let us carefully define exponentiation. 

Definition 1.1: 

Let a be a natural number. We define inductively natural numbers a™ as follows: a 1 = a, and, 
whenever a k is defined, then a k+1 is defined to be a x a*. 

The set S of all natural numbers for which a" is defined is therefore all of N. For, a 1 is defined, and if 
a k is defined there is a prescription for defining a k+1 . This "careful" definition of a n may seem unnecessarily 
detailed. Why not simply define a n as axaxaxa...xan times? The answer is that the ..., though suggestive 
enough, is just not mathematically precise. After all, how would you explain what ... means? The answer to 
that is that you invent a recursive definition to make the intuitive meaning of the ... mathematically precise. 
We will of course use the symbol ... to simplify and shorten our notation, but keep in mind that, if pressed, 
we should be able to provide a careful definition. 

Exercise 1.1 

a. Derive the three laws of exponents for the natural numbers: a n+m = a n x a m . HINT: Fix a 
and m and use the axiom of mathematical induction. a nxm = (a m ) n . HINT: Fix a and m 
and use the axiom of mathematical induction, (a x b) n = a n x b n . HINT: Fix a and b and 
use the axiom of mathematical induction. 

b. Define inductively numbers {Si} as follows: S\ = 1, and if Sk is defined, then Sk+i is defined 
to be Sfc + k+ 1. Prove, by induction, that S n = n (n + 1) /2. Note that we could have defined 
S n using the ... notation by S n = 1 + 2 + 3 + ... + n. 

c. Prove that 

l+4 + 9+16+... + n 2 = V M —-!-. (1.1) 

6 

d. Make a recursive definition of n! = 1 x2x3x ... x n.nl is called n factorial. 



There is a slightly more general statement of the axiom of mathematical induction, which is sometimes of 
use. 

1.3: 

GENERAL AXIOM OF MATHEMATICAL INDUCTION Let S be a subset of the set 
N of natural numbers, and suppose that S satisfies the following conditions 

1. There exists a natural number fc such that k € S. 

2. If S contains a natural number k, then S contains the natural number k + 1. 

Then S contains every natural number n that is larger than or equal to ko. 

From the fundamental set N of natural numbers, we construct the set Z of all integers. First, we simply 
create an additional number called that satisfies the equations + n = n for all n € N and x n = for all 
n € N. The word "create" is, for some mathematicians, a little unsettling. In fact, the idea of zero did not 
appear in mathematics until around the year 900. It is easy to see how the so-called natural numbers came 
by their name. Fingers, toes, trees, fish, etc., can all be counted, and the very concept of counting is what 
the natural numbers are about. On the other hand, one never needed to count zero fingers or fish, so that 
the notion of zero as a number easily could have only come into mathematics at a later time, a time when 
arithmetic was becoming more sophisticated. In any case, from our twenty-first century viewpoint, seems 
very understandable, and we won't belabor the fundamental question of its existence any further here. 

Next, we introduce the so-called negative numbers. This is again quite reasonable from our point of view. 
For every natural number n, we let — n be a number which, when added to n, give 0. Again, the question of 
whether or not such negative numbers exist will not concern us here. We simply create them. 

In short, we will take as given the existence of a set Z, called the integers, which comprises the set N of 
natural numbers, the additional number 0, and the set — N of all negative numbers. We assume that addition 
and multiplication of integers satisfy the three basic algebraic relations of commutativity, associativity, and 
distributivity stated above. We also assume that the following additional relations hold: 

(— n) x ( — k) = n x k, and (— n) x k = n x (— k) = — (n x k) (1.2) 

for all natural numbers n and k. 

1.3 The Rational Numbers 3 

Next, we discuss the set Q of rational numbers, which we ordinarily think of as quotients k/n of integers. Of 
course, we do not allow the "second" element n of the quotient k/n to be 0. Also, we must remember that 
there isn't a 1-1 correspondence between the set Q of all rational numbers and the set of all such quotients 
k/n. Indeed, the two distinct quotients 2/3 and 6/9 represent the same rational number. To be precise, the 
set Q is a collection of equivalence classes of ordered pairs (k, n) of integers, for which the second component 
of the pair is not 0. The equivalence relation among these ordered pairs is this: 

(k,n) = {k ,n) if k x n = n x k . (1.3) 

We will not dwell on this possibly subtle definition, but will rather accept the usual understanding of the 
rational numbers and their arithmetic properties. In particular, we will represent them as quotients rather 
than as ordered pairs, and, if r is a rational number, we will write r = k/n, instead of writing r as the 
equivalence class containing the ordered pair (k,n) . As usual, we refer to the first integer in the quotient 
k/n as the numerator and the second (nonzero) integer in the quotient k/n as the denominator of the 
quotient. The familiar definitions of sum and product for rational numbers are these: 

k k kn + nk , 

-+- = ; 1.4 

n n nn 



3 This content is available online at <http://cnx.Org/content/m36061/l.2/>. 



8 CHAPTER 1. THE REAL AND COMPLEX NUMBERS 

and 

-x- = — . 1.5 

n n nn 

Addition and multiplication of rational numbers satisfy the three basic algebraic relations of commutativity, 
associativity and distributivity stated earlier. 

We note that the integers Z can be identified in an obvious way as a subset of the rational numbers 
Q. Indeed, we identify the integer k with the quotient fc/1. In this way, we note that Q contains the two 
numbers = 0/1 and 1 = 1/1. Notice that any other quotient k/n that is equivalent to 0/1 must satisfy 
k = 0, and any other quotient k/n that is equivalent to 1/1 must satisfy k = n. Remember, k/n = k /n if 
and only if kn = k n. 

The set Q has an additional property not shared by the set of integers Z. It is this: For each nonzero 
element r £ Q, there exists an element r € Q for which rxr' = l. Indeed, if r = k/n ^ 0, then k ^ 0, and 
we may define r = n/k. Consequently, the set Q of all rational numbers is what is known in mathematics 
as a field. 

Definition 1.2: 

A field is a nonempty set F on which there are defined two binary operations, addition (+) and 
multiplication (x), such that the following six axioms hold: 

1. Both addition and multiplication are commutative and associative. 

2. Multiplication is distributive over addition; i.e., 

xx (y + z)=xxy + xxz (1-6) 

for all x,y, z G F. 

3. There exists an element in F, which we will denote by 0, that is an identity for addition; i.e., 
x + = x for all x G F. 

4. There exists a nonzero element in F, which we will denote by 1, that is an identity for 
multiplication; i.e., x x 1 = x for all x € F. 

5. If x € F, then there exists a unique element y e F such that x + y = 0. This element y is 
called the additive inverse of x and is denoted by —a;. 

6. If x € F and x ^ 0, then there exists a unique element y e F such that x x y = 1. This 
element y is called the multiplicative inverse of x and is denoted by x~ l . 

1.4: 

REMARK. There are many examples of fields. (See Exercise 1.2.) They all share certain 
arithmetic properties, which can be derived from the axioms above. If x is an element of a field 
F, then according to one of the axioms above, we have that 1 x x = x. (Note that this "1" is the 
multiplicative identity of the field F and not the natural number 1.) However, it is tempting to 
write x + x = 2xxin the field F. The "2" here is not a priori an element of F, so that the equation 
i + i = 2xiis not really justified. This is an example of a situation where a careful recursive 
definition can be useful. 

Definition 1.3: 

If x is an element of a field F, define inductively elements n ■ x = nx of F by 1 • x = x, and, if k ■ x 
is defined, set (k + 1) ■ x = x + k ■ x. The set S of all natural numbers n for which n ■ x is defined is 
therefore, by the axiom of mathematical induction, all of N. 

Usually we will write nx instead of n ■ x. Of course, nx is just the element of F obtained by adding x to 
itself n times: nx = x + x + x + ... + x. 

Exercise 1.2 

a. Justify for yourself that the set Q of all rational numbers is a field. That is, carefully verify 
that all six of the axioms hold. 



b. Let F-j denote the seven elements {0, 1,2,3, 4, 5, 6}. Define addition and multiplication on F-j 
as ordinary addition and multiplication mod 7. Prove that F? is a field. (You may assume 
that axioms (1) and (2) hold. Check only conditions (3)-(6).) Show in addition that 7x = 
for every x £ F-j. 

c. Let Fg denote the set consisting of the nine elements {0,1,2,3,4,5,6,7,8}. Define addition 
and multiplication on Fg to be ordinary addition and multiplication mod 9. Show that Fg is 
not a field. Which of the axioms fail to hold? 

d. Show that the set N of natural numbers is not a field. Which of the field axioms fail to hold? 
Show that the set Z of all integers is not a field. Which of the field axioms fail to hold? 

Exercise 1.3 

Let F be any field. Verify that the following arithmetic properties hold in F. 

a. x x = for all x £ F. HINT: Use the distributive law and the fact that = + 0. 

b. If x and y are nonzero elements of F, then x x y is nonzero. And, the multiplicative inverse 
of x x y satisfies (x x y) = x^ 1 x y^ 1 . 

c. (— 1) x x = (— x) for all x £ F. 

d. (—x) x (— y) = x x y for all x,y £ F. 

e. x x x — y x y = [x — y) x (x + y) . 

f. (x + y)x(x + y) = xxx + 2-xxy + yxy. 

Definition 1.4: 

Let F be a field, and let a; be a nonzero element of F. 

For each natural number n, we define inductively an element x n in F as follows: x 1 = x, and, 
if x k is defined, set x k+1 = x x x k . Of course, x n is just the product of nx's. 

Define x° to be 1. 

For each natural number n, define x~ n to be the multiplicative inverse (x n )~ of the element 
x n . 

Finally, we define m to be for every positive integer to, and we leave 0~™ and 0° undefined. 

We have therefore defined x m for every nonzero x and every integer m £ Z. 

Exercise 1.4 

Let Fbea field. Derive the following laws of exponents: 

a. x n+m = x n x x m for all nonzero elements x £ F and all integers n and to. HINT: Fix x £ F 
and to £ Z and use induction to derive this law for all natural numbers n. Then use the fact 
that in any field (x x y)~ = i _1 x f '. 

b. x nxm = (x m ) n for all nonzero x £ F and all n,m £ Z. 

c. (x x y) n = x n x y n for all nonzero x,y £ F and all n £ Z. 

From now on, we will indicate multiplication in a field by juxtaposition; i.e., x x y will be denoted simply as 
xy. Also, we will use the standard fractional notation to indicate multiplicative inverses. For instance, 

xy = x— = — . (1-7) 

V V 



1.4 The Real Numbers 4 

What are the real numbers? From a geometric point of view (and a historical one as well) real numbers 
are quantities, i.e., lengths of segments, areas of surfaces, volumes of solids, etc. For example, once we have 



4 This content is available online at <http://cnx.Org/content/m36069/l.2/>. 
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settled on a unit of length, i.e., a segment whose length we call 1, we can, using a compass and straightedge, 
construct segments of any rational length k/n. In some obvious sense then, the rational numbers are real 
numbers. Apparently it was an intellectual shock to the Pythagoreans to discover that there are some other 
real numbers, the so-called irrational ones. Indeed, the square root of 2 is a real number, since we can 
construct a segment the square of whose length is 2 by making a right triangle each of whose legs has length 
1. (By the Pythagorean Theorem of plane geometry, the square of the hypotenuse of this triangle must equal 
2.) And, Pythagoras proved that there is no rational number whose square is 2, thereby establishing that 
there are real numbers tha are not rational. See part (c) of Exercise 1.9. 

Similarly, the area of a circle of radius 1 should be a real number; i.e., 7r should be a real number. It 
wasn't until the late 1800's that Hermite showed that n is not a rational number. One difficulty is that to 
define ir as the area of a circle of radius 1 we must first define what is meant by the " area" of a circle, 
and this turns out to be no easy task. In fact, this naive, geometric approach to the definition of the real 
numbers turns out to be unsatisfactory in the sense that we are not able to prove or derive from these first 
principles certain intuitively obvious arithmetic results. For instance, how can we multiply or divide an area 
by a volume? How can we construct a segment of length the cube root of 2? And, what about negative 
numbers? 

Let us begin by presenting two properties we expect any set that we call the real numbers ought to 
possess. 

Algebraic Properties 

We should be able to add, multiply, divide, etc., real numbers. In short, we require the set of real numbers 
to be a field. 
Positivity Properties 

The second aspect of any set we think of as the real numbers is that it has some notion of direction, some 
notion of positivity. It is this aspect that will allow us to "compare" numbers, e.g., one number is larger than 
another. The mathematically precise way to discuss this notion is the following. 

Definition 1.5: 

A field F is called an ordered field if there exists a subset P C F that satisfies the following two 
properties: 

1. If x, y £ P, then x + y and xy are in P. 

2. If x € F, then one and only one of the following three statements is true. 

i. x e P, 
ii. —x e P, and 
iii. x = 0. (This property is known as the law of tricotomy.) 

The elements of the set P are called positive elements of F, and the elements x for which — x belong to 
P are called negative elements of F. 

As a consequence of these properties of P, we may introduce in F a notion of order. 

Definition 1.6: 

If F is an ordered field, and x and y are elements of F, we say that x < y if y — x & P. We say 
that x < y if either x < y or x = y. 

We say that x > y if y < x, and x > y if y < x. 

An ordered field satisfies the familiar laws of inequalities. They are consequences of the two properties 
of the set P. 

Exercise 1.5 

Using the positivity properties above for an ordered field F, together with the axioms for a field, 
derive the familiar laws of inequalities: 



a. (Transitivity) If x < y and y < z, then x < z. 

b. (Adding like inequalities) If x < y and z < w, then x + z < y + 

c. If x < y and a > 0, then ax < ay. 
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d. If x < y and a < 0, then ay < ax. 

e. If < a < b and < c < d, then ac < bd. 

f. Verify parts (a) through (e) with < replaced by < . 

g. If x and y are elements of F, show that one and only one of the following three relations can 
hold: (i) x < y, (ii) x > y, (iii) x = y. 

h. Suppose x and y are elements of F, and assume that x < y and y < x. Prove that x = y. 

Exercise 1.6 

a. If F is an ordered field, show that 1 GP; i.e., that < 1. HINT: By the law of tricotomy, 
only one of the three possibilities holds for 1. Rule out the last two. 

b. Show that F-j of Exercise 1.2 is not an ordered field; i.e., there is no subset P C F-j such that 
the two positivity properties can hold. HINT: Use part (a) and positivity property (1). 

c. Prove that Q is an ordered field, where the set P is taken to be the usual set of positive 
rational numbers. That is, P consists of those rational numbers a/b for which both a and b 
are natural numbers. 

d. Suppose F is an ordered field and that a; is a nonzero element of F. Show that for all natural 
numbers nnx ^ 0. 

e. (e) Show that, in an ordered field, every nonzero square is positive; i.e., if x / 0, then x 2 € P. 

We remarked earlier that there are many different examples of fields, and many of these are also ordered fields. 
Some fields, though technically different from each other, are really indistinguishable from the algebraic point 
of view, and we make this mathematically precise with the following definition. 

Definition 1.7: 

Let F\ and Fi be two ordered fields, and write P\ and Pi for the set of positive elements in F\ 
and Fi respectively. A 1-1 correspondence J between F\ and Fi is called an isomorphism if 

1. J{x + y) = J (x) + J (y) for &\\ x,y £ Fi. 

2. J(xy) = J(x) J{y) for all x,y € F 1 . 

3. x e Pi if and only if J (x) G Pi. 

1.5: 

REMARK. In general, if A\ and Ai are two algebraic systems, then a 1-1 correspondence 
between A\ and Ai is called an isomorphism if it converts the algebraic structure on A\ into the 
corresponding algebraic structure on Ai. 

Exercise 1.7 

a. Let F be an ordered field. Define a function J : N — » F by J (n) = n ■ 1. Prove that J is an 
isomorphism of N onto a subset N of F. That is, show that this correspondence is one-to-one 
and converts addition and multiplication in TV into addition and multiplication in F. Give an 
example to show that this result is not true if F is merely a field and not an ordered field. 

b. Let F be an ordered field. Define a function J : Q — > F by J(k/n) = k ■ 1 x (n • 1) 
Prove that J is an isomorphism of the ordered field Q onto a subset Q of the ordered field F. 
Conclude that every ordered field F contains a subset that is isomorphic to the ordered field 
Q. 

1.6: 

REMARK. Part (b) of list, p. 11 shows that the ordered field Q is the smallest possible ordered 
field, in the sense that every other ordered field contains an isomorphic copy of Q. However, as 
mentioned earlier, the ordered field Q cannot suffice as the set of real numbers. There is no rational 
number whose square is 2, and we want the square root of 2 to be a real number. See Exercise 1.9 
below. What extra property is there about an ordered field F that will allow us to prove that 
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numbers like \/2,n, and so on are elements of Fl It turns out that the extra property we need is 
related to a quite subtle point concerning upper and lower bounds of sets. It gives us some initial 
indication that the known-to-be subtle concept of a limit may be fundamental to our very notion 
of what the real numbers are. 

Definition 1.8: 

If S is a subset of an ordered field F, then an element x € F is called an upper bound for S if 
x > y for every y € S. An element z is called a lower bound for S if z < y for every y G S. 

A subset S of an ordered field F is called bounded above if it has an upper bound; it is called 
bounded below if it has a lower bound; and it is called bounded if it has both an upper bound and 
a lower bound. 

An element M is called the least upper bound or supremum of a set S if it is an upper bound 
for S and if M < x for every other upper bound x of S. That is, M is less than or equal to any 
other upper bound of S. 

Similarly, an element m is called the greatest lower bound or infimum of S if it is a lower bound 
for S and if z < m for every other lower bound z of S. That is, m is greater than or equal to any 
other lower bound of S. 

Clearly, the supremum and infimum of a set S are unique. For instance, if M and M' are both least 
upper bounds of a set S, then they are both upper bounds of S. We would then have M < M and M < M. 
Therefore, by part (h) of Exercise 1.5, M = M' . 

It is important to keep in mind that an upper bound of a set S need not be an element of S, and in 
particular, the least upper bound of S may or may not actually belong to S. 

If M is the supremum of a set S, we denote M by supS. If m is the infimum of a set S, we denote it by 
infS. 

Exercise 1.8 

a. Suppose S is a nonempty subset of an ordered field F and that x is an element of F. What 
does it mean to say that "x is not an upper bound for 57 

b. Let F be an ordered field, and let S be the empty set, thought of as a subset of F. Prove that 
every element x G F is an upper bound for S and that every element y G F is a lower bound 
for S. HINT: If not, then what? 

c. If S = 0, show that S has no least upper bound and no greatest lower bound. 

1.7: 

REMARK. The preceding exercise shows that peculiar things about upper and lower bounds 
happen when S is the empty set. One point is that just because a set has an upper bound does 
not mean it has to have a least upper bound. That is, no matter which upper bound we choose, 
there is always another one that is strictly smaller. This is a very subtle point, and it is in fact 
quite difficult to give a simple concrete example of this phenomenon. See the remark following 
Theorem 1.6, p. 14. However, part (d) of Exercise 1.9 contains the seed of an example. 

Exercise 1.9 

A natural number a is called even if there exists a natural number c such that o = 2c, and a is 
called odd if there exists a natural number c such that a = 2c + 1. 

a. Prove by induction that every natural number is either odd or even. 

b. Prove that a natural number a is even if and only if a 2 = a x a is even. 

c. Prove that there is no element x of Q whose square is 2. That is, the square root of 2 is not a 
rational number. HINT: Argue by contradiction. Suppose there is a rational number k/n for 
which k 2 /n 2 = 2, and assume, as we may, that the natural numbers k and n have no common 
factor. Observe that k must be even, and then observe that n also must be even. 

d. Let S be the set of all positive rational numbers x for which x 2 = x x x < 2. Prove that S has 
an upper bound and a lower bound. Can you determine whether or not S has a least upper 
bound? 
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The existence of least upper bounds and greatest lower bounds of bounded sets turns out to be the 
critical idea in defining the real numbers. It is precisely the existence of such suprema and infimas 
that enables us to define as real numbers quantities such as \/2,ir,e, and so on. 

Definition 1.9: 

An ordered field F is called complete if every nonempty subset S of F that has an upper bound 
has a least upper bound. 

1.8: 

REMARK. Although Q is an ordered field, we will see that it is not a complete ordered field. 
In fact, the answer to part (d) of Exercise 1.9 is no. The set described there, though bounded 
above, has no least upper bound. In fact, it was one of nineteenth century mathematicians' major 
achievements to prove the following theorem. 

Theorem 1.1: 

There exists a complete ordered field. 

We leave the proof of this theorem to the appendix. 

Perhaps the most reassuring result along these lines is the following companion theorem, whose 
proof we also leave to the appendix. 

Theorem 1.2: 

If Fi and F 2 are two complete ordered fields, then they are isomorphic. 

Taken together, the content of the two preceding theorems is that, up to isomorphism, there exists one 
and only one complete ordered field. For no other reason that that, this special field should be an important 
object in mathematics. Our definition of the real numbers is then the following: 

Definition 1.10: 

By the set R of real numbers we mean the (unique) complete ordered field. 

1.5 Properties of the Real Numbers 5 

Theorem 1.3: 

The set R contains a subset that is isomorphic to the ordered field Q of rational numbers, and 
hence subsets that are isomorphic to N and Z. 

1.9: 
REMARK. The proof of Statement of Theorem 1.3, p. 13 is immediate from part (b) of Exercise 
1.7. In view of this theorem, we will simply think of the natural numbers, the integers, and the 
rational numbers as subsets of the real numbers. 

Having made a definition of the set of real numbers, it is incumbent upon us now to verify that this set 
R satisfies our intuitive notions about the reals. Indeed, we will show that y/2 is an element of R and hence 
is a real number (as plane geometry indicates it should be), and we will show in later chapters that there are 
elements of R that agree with our intuition about e and n. Before we can proceed to these tasks, we must 
establish some special properties of the field R. The first, the next theorem, is simply an analog for lower 
bounds of the least upper bound condition that comes from the completeness property. 

Theorem 1.4: 

If S is a nonempty subset of R that is bounded below, then there exists a greatest lower bound for 
S. 
Proof: 

Define T to be the set of all real numbers x for which —x € S. That is, T is the set — S. We claim 
first that T is bounded above. Thus, let m be a lower bound for the set S, and let us show that the 



5 This content is available online at <http://cnx.Org/content/m36085/l.2/>. 
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number — m is an upper bound for T. If x € T, then —x G S. So, m < —x, implying that —m > x. 
Since this is true for all x e T, the number — m is an upper bound for T. 

Now, by the completeness assumption, T has a least upper bound M> We claim that the number —Mo is 
the greatest lower bound for S. To prove this, we must check two things. First, we must show that —Mo is a 
lower bound for S. Thus, let y be an element of S. Then — y e T, and therefore — y < M . Hence, — M < y, 
showing that —Mo is a lower bound for S. 

Finally, we must show that —Mo is the greatest lower bound for S. Thus, let m be a lower bound for 
S. We saw above that this implies that — m is an upper bound for T. Hence, because Mo is the least upper 
bound for T, we have that — m > Mo, implying that m < —Mo, and this proves that — Mo is the infimum of 
the set S. 

The following is the most basic and frequently used property of least upper bounds. It is our first glimpse 
of " limits." Though the argument is remarkably short and sweet, it will provide the mechanism for many 
of our later proofs, so master this one. 

Theorem 1.5: 

Let S be a nonempty subset of R that is bounded above, and Let Mo denote the least upper bound 
of S; i.e., Mq = supS. Then, for any positive real number e there exists an element t of S such that 
t> M -e. 
Proof: 

Let e > be given. Since Mo — e < Mo, it must be that Mo — e is not an upper bound for S. (Mo 
is necessarily less than or equal to any other upper bound of S.) Therefore, there exists an element 
t € S for which t > Mo — e. This is exactly what the theorem asserts. 

Exercise 1.10 

a. Let S be a nonempty subset of R which is bounded below, and let mo denote the infimum of 
S. Prove that, for every positive 5, there exists an element s of S such that s < mo + S. Mimic 
the proof to Theorem 1.5, p. 14. 

b. Let S be any bounded subset of R, and write —S for the set of negatives of the elements of 
S. Prove that sup(-S) = —infS. 

c. Use part (b) to give an alternate proof of part (a) by using Theorem 1.5, p. 14 and a minus 
sign. 

Exercise 1.11 

a. Let S be the set of all real numbers x for which x < 1. Give an example of an upper bound 
for 5". What is the least upper bound of 5? Is supS an element of 5? 

b. Let S be the set of all x € R for which x 2 < 4. Give an example of an upper bound for S. 
What is the least upper bound of S7 Does supS belong to S? 

We show now that R contains elements other than the rational numbers in Q. Of course this holds for any 
complete ordered field. The next theorem makes this quite explicit. 

Theorem 1.6: 

If x is a positive real number, then there exists a positive real number y such that y 2 = x. That is, 
every positive real number x has a positive square root in R. Moreover, there is only one positive 
square root of x. 
Proof: 

Let S be the set of positive real numbers t for which t 2 < x. Then S is nonempty Indeed, If x > 1, 
then 1 is in S because l 2 = 1 x 1 < 1 x i = i. And, if a; < 1, then x itself is in S, because 
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Also, S is bounded above. In fact, the number 1 + x/2 is an upper bound of S. Indeed, arguing 
by contradiction, suppose there were a t in S such that t > 1 + x/2. Then 

x > t 2 > (l + x/2) 2 = l + x + x 2 /4 > x, (1.8) 

which is a contradiction. Therefore, 1 + x/2 is an upper bound of S, and so S is bounded above. 

Now let y = supS. We wish to show that y 2 = x. We show first that y 2 < x, and then we will 
show that y 2 > x. It will then follow from the tricotomy law that y 2 = x. We prove both these 
inequalities by contradiction. 

So, assume first that y 2 > x, and write a for the positive number y 2 — x. Let e be the positive 
number a/ (2y) , and, using Theorem 1.5, choose a t € S such that t > y — e. Then y + t < (2y) , 
and y — t < e = a/2y. So, 

a = y 2 — x 

= y 2 -t 2 + t 2 -x 

< y 2 -t 2 

= (y + t)(y- 1) 

< 2y(y-t) 

< 2ye 

= a, 

which is a contradiction. Therefore y 2 is not greater than x. 

Now we show that y 2 is not less than x. Again, arguing by contradiction, suppose it is, and let 
e be the positive number x — y 2 . Choose a positive number 5 that is less than y and also less than 
e/ (3y) . Let s = y + 5. Then s is not in S, whence s 2 > x, so that we must have 

e = x — y 2 

= x - s 2 + s 2 - y 2 

< -s 2 -y 2 

= {s + y){s-y) 
(2y + 6) 5 

< 3yS 

< e, 

which again is a contradiction. 

This completes the proof that y 2 = x; i.e., that x has a positive square root. 

Finally, if y were another positive number for which y = x, we show that y = y by ruling out 
the other two cases: y < y and y > y . For instance, if y < y\ then we would have that y 2 < y , 
giving that 

2 ' 2 

x = y < y = x, 

implying that x < x, and this is a contradiction. 

Definition 1.11: 

If x is a positive real number, then the symbol yfx will denote the unique positive number y for 
which y 2 = x. Of course, \/0 denotes the number 0. 

1.10: 

REMARK Part (c) of Exercise 1.9 shows that the field Q contains no number whose square is 
2, and Theorem 1.6, p. 14 shows that the field R does contain a number whose square is 2. We 
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have therefore "proved" that the real numbers is a larger set than the rational numbers. It may 
come as a surprise to learn that we only now have been able to prove that. Look back through the 
chapter to be sure. It follows also that Q itself is not a complete ordered field. If it were, it would 
be isomorphic to R, by Theorem 1.2, so that it would have to contain a square root of 2, which it 
does not. 

Definition 1.12: 

A real number x that is not a rational number, i.e., is not an element of the subset Q of R, is 
called an irrational number. 

Exercise 1.12 

a. Prove that every positive real number has exactly 2 square roots, one positive (\fx) and the 
other negative (—y/x). 

b. Prove that if a; is a negative real number, then there is no real number y such that y 2 = x. 

c. Prove that the product of a nonzero rational number and an arbitrary irrational number 
must be irrational. Show by example that the sum and product of irrational numbers can be 
rational. 



1.6 Intervals and Approximation 6 

We introduce next into the set of real numbers some geometric concepts, namely, a notion of distance between 
numbers. Of course this had to happen, for geometry is the very basis of mathematics. 

Definition 1.13: 

The absolute value of a real number x is denoted by |cc| and is defined as follows: 

1. (i) |0| = 0. 

2. (ii) If x > then \x\ = x. 

3. (iii) If x < (-x > 0) then \x\ = -x. 

We define the distanced (x,y) between two real numbers x and y by d(x,y) = \x — y\. 
Obviously, such definitions of absolute value and distance can be made in any ordered field. 

Exercise 1.13 

Let x and y be real numbers. 

a. Show that |a;| > 0, and that x < \x\. 

b. Prove the Triangle Inequality for absolute values. 

\x + y\ < \x\ + \y\. (1.9) 

HINT: Check the three cases x + y > 0,x + y < 0, and x + y = 0. 

c. Prove the so-called ' ' backward" triangle inequality. 

|x-l/|>IN-|y||. (1.10) 

HINT: Write \x\ = | (x - y) + y\, and use part (b). 

d. Prove that \xy\ = \x\\y\. 

e. Prove that \x\ = vx 2 for all real numbers x. 



6 This content is available online at <http://cnx.Org/content/m36094/l.2/>. 
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f. Prove the Triangle Inequality for the distance function. That is, show that 

d(x, y) < d {x, z) + d (z, y) (1-11) 

for all x,y, z G R. 

Exercise 1.14 

a. Prove that x = y if \x — y\ < e for every positive number e. HINT: Argue by contradiction. 
Suppose x ^ y, and take e = \x — y\/2. 

b. Prove that x = y if and only if x — y < e and y — x < e for every positive e. 

Definition 1.14: 

Let a and b be real numbers for which a < b. By the open interval (a,b) we mean the set of all real 
numbers x for which a < x < b, and by the closed interval [a, b] we mean the set of all real numbers 
x for which a < x < b. 

By (a, oo) we mean the set of all real numbers x for which a < x, and by [a, oo) we mean the set of all 
real numbers x for which a < x. 

Analogously, we define (—00,6) and (—00,6] to be respectively the set of all real numbers x for which 
x < b and the set of all real numbers x for which x < b. 

Exercise 1.15 

a. Show that the intersection of two open intervals either is the empty set or it is again an open 
interval. 

b. Show that (a, b) = (—00, b) n (a, 00) . 

c. Let y be a fixed real number, and let e be a positive number. Show that the inequality 
\x — y\ < e is equivalent to the pair of inequalities 

y — e < x&ndx < y + e; (1-12) 

i.e., show that x satisfies the first inequality if and only if it satisfies the two latter ones. 
Deduce that \x — y\ < e if and only if x is in the open interval (y — e, y + e) . 

Here is one of those assertions that seems like an obvious fact. However, it requires a proof which we only 
now can give, for it depends on the completeness axiom, and in fact is false in some ordered fields. 

Theorem 1.7: 

Let N denote the set of natural Numbers, thought of as a subset of R. Then N is not bounded 
above. 
Proof: 

Suppose false. Let M be an upper bound for the nonempty set N, and let M be the least upper 
bound for N. Taking e to be the positive number 1/2, and applying Theorem 1.5, we have that 
there exists an element k of N such that Mo — 1/2 < k. But then Mo — 1/2 + 1 < k + 1, or, 
M + 1/2 < k + 1. So M < k + 1. But M > k + 1 because M is an upper bound for N. We have 
thus arrived at a contradiction, and the theorem is proved. 

1.11: 
REMARK As mentioned above, there do exist ordered fields F in which the subset Nis bounded 
above. Such fields give rise to what is called "nonstandard analysis," and they were first introduced 
by Abraham Robinson in 1966. The fact that R is a complete ordered field is apparently crucial to 
be able to conclude the intuitively clear fact that the natural numbers have no upper bound. 

Exercise 1.16 presents another intuitively obvious fact, and this one is in some real sense the basis for 
many of our upcoming arguments about limits. It relies on the preceding theorem, is in fact just a corollary, 
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so it has to be considered as a rather deep property of the real numbers; it is not something that works in 
every ordered field. 

Exercise 1.16 

Prove that if e is a positive real number, then there exists a natural number TV such that 1/TV < e. 

Theorem 1.8, p. 18 and Exercise 1.17 show that the set Q of rational numbers is "everywhere dense" in the 
field R. That is, every real number can be approximated arbitrarily closely by rational numbers. Again, we 
point out that this result holds in any complete ordered field, and it is the completeness that is critical. 

Theorem 1.8: 

Let a < b be two real numbers. Then there exists a rational number r = p/q in the open interval 
(a, 6) . In fact, there exist infinitely many rational numbers in the interval (a, b) . 
Proof: 

If a < and b > 0, then taking r = satisfies the first statement of the theorem. Assume first that 
a > and b > a. Let n be a natural number for which 1/n is less than the positive number b — a. 
(Here, we are using the completeness of the field, because we are referring to Theorem 1.7, where 
completeness was vital.) If a = 0, then b = b — a. Setting r = 1/n, we would have that a < r < b. 
So, again, the first part of the theorem would be proved in that case. 

Suppose then that a > 0, and choose the natural number q to be such that \jq is less than 
the minimum of the two positive numbers o and b — a. Now, because the number aq is not an 
upper bound for the set TV, we may let p be the smallest natural number that is larger than aq. Set 
r = p/q. 

We have first that aq < p, implying that a < p/q = r. Also, because p is the smallest natural 
number larger than aq, we must have that p— 1 < aq. Therefore, (p — 1) /q < a, or (p/q) — (l/q) < a, 
implying that r = p/q < a + 1/q < a+ (b — a) = b. Hence, a < r and r < b, and the first statement 
of the theorem is proved when both a and b are nonnegative. 

If both a and b are nonpositive, then both —b and —a are nonnegative, and, using the first part 
of the proof, we can find a rational number r such that —b<r< —a. So, a < — r < b, and the 
first part of the theorem is proved in this case as well. 

Clearly, we may replace b by r and repeat the argument to obtain another rational r\ such that 
a < r\ < r < b. Then, replacing b by r\ and repeating the argument, we get a third rational r-i such 
that a < T2 < r\ < r < b. Continuing this procedure would lead to an infinite number of rationals, 
all between a and b. This proves the second statement of the theorem. 

Exercise 1.17 

a. Let e > be given, and let k be a nonnegative integer. Prove that there exists a rational 
number p/q such that 

he <p/q < 0+ l)e. (1.13) 

b. Let a; be a positive real number and let e be a positive real number. Prove that there exists 
a rational number p/q such that x — e < p/q < x. State and prove an analogous result for 
negative numbers x. 

Exercise 1.18 

a. If a and b are real numbers with a < b, show that there is an irrational number x (not a 
rational number) between o and 6, i.e., with a < x < b. HINT: Apply Theorem 1.8, p. 18 to 
the numbers ay/2 and by/2. 

b. Conclude that within every open interval (a, b) there is a rational number and an irrational 
number. Are there necessarily infinitely many rationals and irrationals in (a, b)l 

The preceding exercise shows the "denseness" of the rationals and the irrationals in the reals. It is essentially 
clear from this that every real number is arbitrarily close to a rational number and an irrational one. 
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1.7 The Geometric Progression and the Binomial Theorem 7 

There are two special algebraic identities that hold in R (in fact in any field F whatsoever) that we emphasize. 
They are both proved by mathematical induction. The first is the formula for the sum of a geometric 
progression. 

Theorem 1.9: Geometric Progression 

Let i be a real number, and let n be a natural number. Then, 

1. If x ± 1, then 

n 1 _ ~n+l 

5> = I^— . (1.14) 



l-x 

3=0 



2. If x = 1, then 



^V = n+1. (1.15) 

3=0 

Proof: 

The second claim is clear, since there are n + 1 summands and each is equal to 1. 

We prove the first claim by induction. Thus, if n = 1, then the assertion is true, since 

i i _ 1 _ 2 

\^x j =x° + x 1 = l + x= (1 + x) - = — . (1.16) 

^— ' 1 — x 1 — X 

3=0 

Now, supposing that the assertion is true for the natural number k, i.e., that 

JL i _ T fc+i 

£l ^- r _, (1.17) 

3=0 

let us show that the assertion holds for the natural number k + 1. Thus 



3=U ^3 = 

l~x k + 1 , fc+1 
1— a: 
l-afc + i+afc + i-^ + a 



(1.18) 



l-x 



which completes the proof. 

The second algebraic formula we wish to emphasize is the Binomial Theorem. Before stating it, we must 
introduce some useful notation. 

Definition 1.15: 

Let n be a natural number. As earlier in this chapter, we define n! as follows: 

n\ = n x (n - 1) x (n - 2) x ... x 2 x 1. (1.19) 

For later notational convenience, we also define 0! to be 1. 

If k is any integer for which < k < n, we define the binomial coefficient (^) by 



nx(n-l)x(H-2)x...x(n-Hl) 



kl k\{n-k)\ k\ 



(1.20) 



7 This content is available online at <http://cnx.Org/content/m36104/l.2/>. 
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Exercise 1.19 

a. Prove that (£) = 1, (™) = n and (™) = 1. 

b. Prove that 

/n\ 2n k 

(k)*mr (L21) 

for all natural numbers n and all integers < k < n. 

c. Prove that 

' n + 1\ /n\ ( n \ , 

1 (1.22) 



k ) \kJ Vfc-1 
for all natural numbers n and all integers 1 < k < n. 

Theorem 1.10: 

If x, y € R and n is a natural number, then 

n 

(x + y) n = Y<( n k ) xk y n ~ k - ( L23 ) 

fe=0 

Proof: 

We shall prove this theorem by induction. If n = 1, then the assertion is true, for (x + y) = x + y 
and 

£ G) xkyi ~ k = (J) xV + G) * v =x+y - (l24) 

Now, assume that the assertion holds for the natural number j; i.e., 



(x + yy = J2( J )x k yi-\ (1.25) 



fc=o v ' 

and let us prove that the assertion holds for the natural number j + 1 . We will make use of part 
(c) of Exercise 1.19. We have that 

{x + yY +1 = (x + y)(x + yY 

(* + y)£Lo(£)*V- fe 

*£i=o (I) * k y j - k + y Eto (i) *V- fe 

Ei= (I) * k+1 y j ~ k + n= (i) -V +1 - fe 

Y:izl{i)x k+1 y j - k + (i)x j+1 y 

+ ELi(i)^V' +1 - fc + ( J )^V +1 

^ +1 +Ei=i( fe i 1 )^v +1 - fc 

+ Ei=i(D^V +1 - fe + y j+1 

^' +1 + ELi ( j f) * k y 3+1 ~ k + v 1+1 
= (£1) ^ + V + Ei=i ( 't 1 ) *V +1 - fc + CJ 1 ) *V +1 
= ' Eito( j f)^V' +1 - fc , 



(1.26) 
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which shows that the assertion of the theorem holds for the natural number j + 1. This completes 
the proof. 

The next exercise is valid in any ordered field, but, since we are mainly interested in the order field R, 
we state everything in terms of that field. 

Exercise 1.20 

a. If x and y are positive real numbers, and if n and k are natural numbers with k < n, show 
that (x + y) n > (l)x k y n - k . 

b. For any positive real number x and natural number n, show that (1 + x) n > 1 + nx. 

c. For any real number x > — 1 and natural number n, prove that (1 + x) > 1 + nx. HINT: 
Do not try to use the binomial theorem as in part (b); it won't work because the terms are 
not all positive; prove this directly by induction. 

There is one more important algebraic identity, which again can be proved by induction. It is actually just 
a corollary of the geometric progression formula. 

Theorem 1.11: 

If x, y G R and n is a natural number, then 

/n-l 

x n - y n = (x - y) \J2 a^'y n - 1 - J '.(1.27) 
\j'=o 

Proof: 

If n = 1 the theorem is clear. Suppose it holds for a natural number k, and let us prove the identity 
for the natural number k + 1. We have 

x k+1 - y k+1 = x k+1 - x k y + x k y - y k+1 

{x-y)x k + y (x k - y k ) 

= {x-y)x k + y(x-y)(J2 k -Zl x:l y k ~ 1 ~ :l ) 

' '' (1.28) 



(a; - y) x k + (x - y) (Ej=o x3 y k 3 
(x - y) (x k y k ~ k + Y!]=l xiy k -i) 

(z-y)(E-= *V- j ) 



, which shows that the assertion holds for the natural number k + 1. So, by induction, the theorem 
is proved. 

Exercise 1.21 

Let x and y be real numbers. 

a. Let n be an odd natural number; i.e., n = 2k + 1 for some natural number k. Show that 

(71-1 
3=0 

HINT: Write x n + y n = x n - {-y) n '. 

b. Show that x 2 + y 2 can not be factored into a product of the form (ax + by) (ex + dy) for any 
choices of real numbers o, b, c, and d. 
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Using the Binomial Theorem together with the preceding theorem, we may now investigate the existence of 
nth roots of real numbers. This next theorem is definitely not valid in any ordered field, for it again depends 
on the completeness property. 

Theorem 1.12: 

Let n be a natural number and let £ be a positive real number. Then there exists a unique positive 
real number y such that y n = x; i.e., x has a unique positive nth root. 
Proof: 

Note first that if < t < s, then t n < s n . (To see this, argue by induction, and use part (e) of 
Exercise 1.5.) Using this, we mimic the proof of Theorem 1.6, p. 14. Thus, let S be the set of 
all positive real numbers t for which t n < x. Then S is nonempty and bounded above. Indeed, if 
x > 1, then leS, while if x < 1, then x itself is in S. Therefore, S is nonempty. Also, using part 
(b) of Exercise 1.20, we see that 1 + (x/n) is an upper bound for S. For, if t > 1 + x/n, then 

t n > (l + {x/n)) n >l + n(x/n)> x. (1.30) 

Now let y = supS, and let us show that y n = x. We rule out the other two possibilities. First, if 
y n > x, let e be the positive number y n — x, and define e to be the positive number ej (ny n ^ 1 ) . 
Then, using Theorem 1.5, p. 14, choose t s S so that y — e < t < y. (Theorem 1.5, p. 14 is where 
the completeness of the ordered field R is crucial.) We have 



< 



y" 


— X 


■t n 


+ t" 


y n 


-t n 



(1.31) 



< (v-i)(£?=oW , - 1 - j 
= (y-^fcCoV- 1 



3=0 

Tl-1 



< e ny 



and this is a contradiction. Therefore, y n is not greater than x. 

Now, if y n < x, let e be the positive number x — y n , and choose a S > such that 5 < 1 and 
5 < e/(y + 1)™ . Then, using the Binomial Theorem, we have that 

(y + s) n = E n k=0 (l)y k s n - k 

= y n + EVo( n k )y k 5 n - k 

= y n + sJ2Vo { n k )y k 5 n - l - k 

< y n + 5J2 n k ^( n k )y k ^- k (132) 

y" + 6(y+l) n 

= x — e + 6(y + 1)" 

< x — e + e 

= x, 

implying that y + 5 & S. But this is a contradiction, since y = supS. Therefore, y n is not less than 
x, and so y n = x. 
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We have shown the existence of a positive nth root of x. To see the uniqueness, suppose y and 
y are two positive nth roots of x. Then 



= y n -y 

= (y-y) (E"=o y ] v 



n ! -n -j-1 (1.33) 

ri-1 i .n-j-1 



which implies that either y — y = or X^=o y 3 y = 0- Since this latter sum consists of 

positive terms, it cannot be 0, whence y = y . This shows that there is but one positive nth root of 
x, and the theorem is proved. 

Exercise 1.22 

a. Show that if n = 2k is an even natural number, then every positive real number has exactly 
two distinct nth roots. 

b. If n = 2k + 1 is an odd natural number, show that every real number has exactly one nth 
root. 

c. If n is a natural number greater than 1, prove that there is no rational number whose nth 
power equals 2, i.e., the nth root of 2 is not a rational number. 



1.8 The Complex Numbers 8 

It is useful to build from the real numbers another number system called the complex numbers. Although 
the real numbers R have many of the properties we expect, i.e., every positive number has a positive square 
root, every number has a cube root, and so on, there are somewhat less prominent properties that R fails to 
possess. For instance, negative numbers do not have square roots. This is actually a property that is missing 
in any ordered field, since every square is positive in an ordered field. See part (e) of Exercise 1.6. One way 
of describing this shortcoming on the part of the real numbers is to note that the equation 1 + x 2 = has 
no solution in the real numbers. Any solution would have to be a number whose square is —1, and no real 
number has that property. As an initial extension of the set of real numbers, why not build a number system 
in which this equation has a solution? 

We faced a similar kind of problem earlier on. In the set N there is no element j such that j + n = n 
for all n € N. That is, there was no element like in the natural numbers. The solution to the problem 
in that case was simply to "create" something called zero, and just adjoin it to our set N. The same kind 
of solution exists for us now. Let us invent an additional number, this time denoted by i, which has the 
property that its square i 2 is — 1. Because the square of any nonzero real number is positive, this new number 
i was traditionally referred to as an "imaginary" number. We simply adjoin this number to the set R, and we 
will then have a number whose square is negative, i.e., —1. Of course, we will require that our new number 
system should still be a field; we don't want to give up our basic algebraic operations. There are several 
implications of this requirement: First of all, if y is any real number, then we must also adjoin to R the 
number y x i = yi, for our new number system should be closed under multiplication. Of course the square 
of iy will equal i 2 y 2 = —y 2 , and therefore this new number iy must also be imaginary, i.e., not a real number. 
Secondly, if x and y are any two real numbers, we must have in our new system a number called x + yi, 
because our new system should be closed under addition. 

Definition 1.16: 

Let i denote an object whose square i 2 = — 1. Let C be the set of all objects that can be represented 
in the form z = x + yi, where both x and y are real numbers. 



s This content is available online at <http://cnx.Org/content/m36113/l.2/>. 
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Define two operations + and x on C as follows: 

(x + yi) + (x + y i) = x + x + (y + y)i, (1-34) 

and 

(x + iy) (x + iy) = xx + xiy + iyx + iyiy = xx — yy + (xy + yx ) i. (1.35) 

Theorem 1.13: 

1. The two operations + and x defined above are commutative and associative, and multiplica- 
tion is distributive over addition. 

2. Each operation has an identity: (0 + Oi) is the identity for addition, and (1 + Oi) is the identity 
for multiplication. 

3. The set C with these operations is a field. 

Proof: 

We leave the proofs of Parts (1) and (2) to the following exercise. To see that C is a field, we need 
to verify one final condition, and that is to show that if z = x + yi ^ = + Oi, then there exists 
a w = u + vi such that zxw = l = l + 0i. Thus, suppose z = x + yi / 0. Then at least one of the 
two real numbers x and y must be nonzero, so that x 2 + y 2 > 0. Define a complex number w by 



x ~y 

W = - T - j + -2— 2*- U-36 

x z + y z x z + y z 



We then have 



zxw = {x + yt) x (_ 3 | F + -^L_ 



_Z2 L I ~ -V 



x -\-y x +y V x 2 +y 



V-, 



& + ^h^i ( L37 ) 



1 + 0* 

i, 

as desired. 

Exercise 1.23 

Prove parts (1) and (2) of Theorem 1.13, p. 24. 

One might think that these kinds of improvements of the real numbers will go on and on. For instance, we 
might next have to create and adjoin another object j so that the number i has a square root; i.e., so that 
the equation i — z 2 = has a solution. Fortunately and surprisingly, this is not necessary, as we will see 
when we finally come to the Fundamental Theorem of Algebra in Theorem 7.7, Fundamental Theorem of 
Algebra, p. 195. 

The subset of C consisting of the pairs x + Oi is a perfect (isomorphic) copy of the real number system 
R. We are justified then in saying that the complex number system extends the real number system, and we 
will say that a real number x is the same as the complex number x + Oi. That is, real numbers are special 
kinds of complex numbers. The complex numbers of the form + yi are called purely imaginary numbers. 
Obviously, the only complex number that is both real and purely imaginary is the number = + Oi. The 
set C can also be regarded as a 2-dimensional space, a plane, and it is also helpful to realize that the complex 
numbers form a 2-dimensional vector space over the field of real numbers. 
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Definition 1.17: 

If z = x + yi, we say that the real number x is the real part of z and write x = 3? (z) . We say that 
the real number y is the imaginary part of z and write y = Q (z) . 

If z = x + yi is a complex number, define the complex conjugated of z by z = x — yi. 

The complex number i satisfies i 2 = — 1, showing that the negative number —1 has a square root in C, 
or equivalently that the equation 1 + z 2 = has a solution in C. We have thus satisfied our initial goal of 
extending the real numbers. But what about other complex numbers? Do they have square roots, cube 
roots, nth roots? What about solutions to other kinds of equations than 1 + z 2 ? 

Exercise 1.24 

a. Prove that every complex number has a square root. HINT: Let z = a + bi. Assume w = x + yi 
satisfies w 2 = z, and just solve the two equations in two unknowns that arise. 

b. Prove that every quadratic equation az 2 + bz + c = 0, for a, b, and c complex numbers, has 
a solution in C. HINT: If a = 0, it is easy to find a solution. If a / 0, we need only find a 
solution to the equivalent equation 



2 




b 




c 


+ 


—z 


+ 


— 






a 




a 



0. (1.38) 

Justify the following algebraic manipulations, and then solve the equation. 



k z+ c = z 2 + b z+ b^_b^ 
a a a Aa z 4a A 



( z + -M - -£- 

\ Z ^ 2a) ia 2 



(1.39) 



What about this new field CI Does every complex number have a cube root, a fourth root, does every 
equation have a solution in CI A natural instinct would be to suspect that C takes care of square roots, 
but that it probably does not necessarily have higher order roots. However, the content of the Fundamental 
Theorem of Algebra, to be proved in Section 7.4, is that every equation of the form P (z) = 0, where P is a 
nonconstant polynomial, has a solution in C. This immediately implies that every complex number c has an 
nth root, for any solution of the equation z n — c = would be an nth root of c. 

The fact that the Fundamental Theorem of Algebra is true is a good indication that the field C is a 
"good" field. But it's not perfect. 

Theorem 1.14: 

In no way can the field C be made into an ordered field. That is, there exists no subset P of C 
that satisfies the two positivity axioms. 
Proof: 

Suppose C were an ordered field, and write P for its set of positive elements. Then, since every 
square in an ordered field must be in P (part (e) of Exercise 1.6), we must have that — 1 = i 2 must 
be in P. But, by part (a) of Exercise 1.6, we also must have that 1 is in P, and this leads to a 
contradiction of the law of tricotomy. We can't have both 1 and —1 in P. Therefore, C is not an 
ordered field. 

Although we may not define when one complex number is smaller than another, we can define the absolute 
value of a complex number and the distance between two of them. 

Definition 1.18: 

If z = x + yi is in C, we define the absolute value of z by 



\z\ = y/x 2 + y 2 . (1.40) 



We define the distanced (z,w) between two complex numbers z and w by 

d (z, w) = \z — w\. 
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If c € C and r > 0, we define the open disk of radius r around c, and denote it by B r (c) , by 

B r {c) = {zeC: \z-c\ <r}. (1.41) 

The closed disk of radius r around c is denoted by B r (c) and is defined by 

B r (c) = {z£ C : \z-c\ <r}. (1.42) 

We also define open and closed punctured disks B r (c) and B r (c) around c by 

Br(c) = {z:0 < |z-c| <r} (1.43) 



and 



B r (c) = {z : < \z-c< r}. (1.44) 



These punctured disks are just like the regular disks, except that they do not contain the central 
point c. 

More generally, if S is any subset of C, we define the open neighborhood of radius r around S, 
denoted by N r (S) , to be the set of all z such that there exists & w e S for which \z — w\ < r. That 
is, N r (S) is the set of all complex numbers that are within a distance of r of the set S. We define 
the closed neighborhood of radius r around S, and denote it by N r (S) , to be the set of all z € C 
for which there exists a w € S such that \z — w\ < r. 

Exercise 1.25 

a. Prove that the absolute value of a complex number z is a nonnegative real number. Show in 
addition that \z\ = zz. 

b. Let x be a real number. Show that the absolute value of x is the same whether we think of 
x as a real number or as a complex number. 

c. Prove that max{\$t{z) |,|9(,z) |) < \z\ < \$l(z) | + \$${z) |. Note that this just amounts to 
verifying that 

max (\x\, \y\) < \] x 2 + y 2 < \x\ + \y\ (1-45) 

for any two real numbers x and y. 



d. For any complex numbers z and w, show that z + w = z + w, and that z = z. 

e. Show that z + z = 23? (z) and z — z = 2i9 (z) . 

f. If z = a + bi and w = a ' + b'i, prove that \zw\ = \z\\w\. HINT: Just compute 

\{a + bi)(a +b'i)\ 2 . 

The next theorem is in a true sense the most often used inequality of mathematical analysis. We have already 
proved the triangle inequality for the absolute value of real numbers, and the proof was not very difficult in 
that case. For complex numbers, it is not at all simple, and this should be taken as a good indication that 
it is a deep result. 

Theorem 1.15: Triangle Inequality 

If z and z are two complex numbers, then 

\z + z'\<\z\ + \z'\ (1.46) 

and 

|*-*'|>IM-|*'||. (1-47) 
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Proof: 

We use the results contained in Exercise 1.25. 

Iz + z'f 



= 


(z + z) (z + z) 


= 


(z + z) h + ~z f ) 


= 


zz + zz + zz' + z z 


= 


2 ' — i — ' 2 

\z\ + zz + zz +\z 


= 


\z\ 2 + 2?H(z'z) + \z'\ 2 


< 


\z\ 2 + 2\R(z'z)\+\z\< 


< 


\z\ 2 + 2\z'z\ + \z'\ 2 


= 


\z\ 2 + 2\z'\\z\ + \z'f 


= 


(\z\ + \z'\) 2 . 



(1.48) 



The Triangle Inequality follows now by taking square roots. 

1.12: 
REMARK The Triangle Inequality is often used in conjunction with what's called the "add and 
subtract trick." Frequently we want to estimate the size of a quantity like \z— w\, and we can often 
accomplish this estimation by adding and subtracting the same thing within the absolute value 
bars: 

\z — w\ = \z — v + v — w\ < \z — v\ + \v — w\. (1-49) 

The point is that we have replaced the estimation problem of the possibly unknown quantity \z — w\ 
by the estimation problems of two other quantities \z — v\ and \v — w\. It is often easier to estimate 
these latter two quantities, usually by an ingenious choice of v of course. 

Exercise 1.26 

a. Prove the second assertion of the preceding theorem. 

b. Prove the Triangle Inequality for the distance function. That is, prove that 

d(z,w) < d{z,v)+d{v,w) (1.50) 

for all z, w, v € C. 

c. Use mathematical induction to prove that 

n n 

lX>l<£l°*l- (1-51) 

It may not be necessary to point out that part (b) of the preceding exercise provides a justification for the 
name "triangle inequality." Indeed, part (b) of that exercise is just the assertion that the length of one side 
of a triangle in the plane is less than or equal to the sum of the lengths of the other two sides. Plot the three 
points z, w, and v, and see that this interpretation is correct. 

Definition 1.19: 

A subset S of C is called Bounded if there exists a real number M such that \z\ < M for every z 
in S. 

Exercise 1.27 
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a. Let S be a subset of C. Let Si be the subset of R consisting of the real parts of the complex 
numbers in S, and let 5*2 be the subset of R consisting of the imaginary parts of the elements 
of S. Prove that S is bounded if and only if S\ and £2 are both bounded. 

HINT: Use Part (c) of Exercise 1.25.. 

b. Let S be the unit circle in the plane, i.e., the set of all complex numbers z = x + iy for which 
\z\ = 1. Compute the sets Si and S2 of part (a). 

Exercise 1.28 

a. Verify that the formulas for the sum of a geometric progression and the binomial theorem 
(Theorem 1.9, Geometric Progression, p. 19 and Theorem 1.10, p. 20) are valid for complex 
numbers z and z . HINT: Check that, as claimed, the proofs of those theorems work in any 
field. 

b. Prove Theorem 1.11, p. 21 for complex numbers z and z . 



Chapter 2 

The Limit of a Sequence of Numbers 



2.1 Definition of the Number e 1 

This chapter contains the beginnings of the most important, and probably the most subtle, notion in math- 
ematical analysis, i.e., the concept of a limit. Though Newton and Leibniz discovered the calculus with its 
tangent lines described as limits of secant lines, and though the Greeks were already estimating areas of 
regions by a kind of limiting process, the precise notion of limit that we use today was not formulated until 
the 19th century by Cauchy and Weierstrass. 

The main results of this chapter are the following: 

1. The definition of the limit of a sequence, 

2. The definition of the real number e (Theorem 2.3, Definition of e., p. 35), 

3. The Squeeze Theorem (Theorem 2.5, Squeeze Theorem, p. 37), 

4. the Bolzano Weierstrass Theorem (Theorem 2.8, Bolzano- Weierstrass, p. 40 and Theorem 2.10, 
p. 45), 

5. The Cauchy Criterion (Theorem 2.9, Cauchy Criterion, p. 43), 

6. the definition of an infinite series, 

7. the Comparison Test (Theorem 2.17, Comparison Test, p. 49), and 

8. the Alternating Series Test (Theorem 2.18, Alternating Series Test, p. 51). 

These are powerful basic results about limits that will serve us well in later chapters. 

2.2 Sequences and Limits 2 

Definition 2.1: 

A sequence of real or complex numbers is defined to be a function from the set N of natural 
numbers into the setR or C. Instead of referring to such a function as an assignment n — ► / (n) , we 
ordinarily use the notation {a„},{a„}f°, or {a\, 02, as, •••}• Here, of course, a n denotes the number 

/(")■ 
2.1: 

REMARK We expand this definition slightly on occasion to make some of our notation more 
indicative. That is, we sometimes index the terms of a sequence beginning with an integer other 
than 1. For example, we write {o„}g°,{ao,ai, ...}, or even {a„}f° 3 . 

We give next what is the most significant definition in the whole of mathematical analysis, i.e., what it 
means for a sequence to converge or to have a limit. 



1 This content is available online at <http://cnx.Org/content/m36117/l.2/>. 
2 This content is available online at <http://cnx.Org/content/m36118/l.2/>. 
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Definition 2.2: 

Let {a n } be a sequence of real numbers and let L be a real number. The sequence {a n } is said 
to converge to L, or that L is the limit of {a n }, if the following condition is satisfied. For every 
positive number e, there exists a natural number N such that if n > N, then \a n — L\ < e. 
In symbols, we say L = lima n or 

L = lira a n . (2-1) 

n — > oo 

We also may write a n i— > L. 

If a sequence {a n } of real or complex numbers converges to a number L, we say that the sequence 
{a n } is convergent. 

We say that a sequence {a n } of real numbers diverges to +oo if for every positive number M, 
there exists a natural number N such that if n > N, then a n > M. Note that we do not say that 
such a sequence is convergent. 

Similarly, we say that a sequence {a n } of real numbers diverges to — oo if for every real number 
M, there exists a natural number TV such that if n > N, then a n < M. 

The definition of convergence for a sequence {z n } of complex numbers is exactly the same as 
for a sequence of real numbers. Thus, let {z n } be a sequence of complex numbers and let L be a 
complex number. The sequence {z n } is said to converge to L, or that L is the limit of {z n }, if the 
following condition is satisfied. For every positive number e, there exists a natural number TV such 
that if n > TV, then \z n — L\ < e. 

2.2: 

REMARKS The natural number TV of the preceding definition surely depends on the positive 
number e. If e is a smaller positive number than e, then the corresponding TV' very likely will need 
to be larger than TV. Sometimes we will indicate this dependence by writing TV (e) instead of simply 
TV. It is always wise to remember that TV depends on s. On the other hand, the TV or TV (e) in this 
definition is not unique. It should be clear that if a natural number TV satisfies this definition, then 
any larger natural number M will also satisfy the definition. So, in fact, if there exists one natural 
number that works, then there exist infinitely many such natural numbers. 

It is clear, too, from the definition that whether or not a sequence is convergent only depends 
on the "tail" of the sequence. Specifically, for any positive integer K, the numbers oi, 02, ..., ax can 
take on any value whatsoever without affecting the convergence of the entire sequence. We are only 
concerned with a n 's for n > TV, and as soon as TV is chosen to be greater than K, the first part of 
the sequence is irrelevant. 

The definition of convergence is given as a fairly complicated sentence, and there are several 
other ways of saying the same thing. Here are two: For every e > 0, there exists a TV such that, 
whenever n > TV,|a n — L\ < e. And, given an e > 0, there exists a TV such that \a n — L\ < e for all 
n for which n > TV. It's a good idea to think about these two sentences and convince yourself that 
they really do "mean" the same thing as the one defining convergence. 

It is clear from this definition that we can't check whether a sequence converges or not unless we 
know the limit value L. The whole thrust of this definition has to do with estimating the quantity 
\a n — L\. We will see later that there are ways to tell in advance that a sequence converges without 
knowing the value of the limit. 

Example 2.1 

Let a n = l/n, and let us show that lima n = 0. Given an e > 0, let us choose a TV such that 
1/TV < e. (How do we know we can find such a TV?) Now, if n > TV, then we have 

K-0| = |"l = -<^<£, (2-2) 

n n TV 

which is exactly what we needed to show to conclude that = lima n . 
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Example 2.2 

Let a n = (2n + 1) / (1 — 3n) , and let L = —2/3. Let us show that L = lima n . Indeed, if e > is 
given, we must find a N, such that if n > N then \a n + (2/3) | < e. Let us examine the quantity 
\a n + 2/3 1 . Maybe we can make some estimates on it, in such a way that it becomes clear how to 
find the natural number N. 

|a„ + (2/3)| = 



1 2n+l i 
1 l-3n " r 


-1 
3 


6n+3+2- 


G/; 


3-9n 




1 5 


1 


1 3-9n 




5 




9n-3 




5 




6n+3n— 


3 


_5_ 

6/1 




1 





(2.3) 



< 

< 

for all n > 1. Therefore, if N is an integer for which N > 1/e, then 

\a n + 2/3| < 1/n < 1/JV < e, (2.4) 

whenever n > JV, as desired. (How do we know that there exists a N which is larger than the 
number 1/e?) 

Example 2.3 

Let a n = 1/y/n, and let us show that lima n = 0. Given an e > 0, we must find an integer N that 
satisfies the requirements of the definition. It's a little trickier this time to choose this N. Consider 
the positive number e 2 . We know, from Exercise 1.16, that there exists a natural number N such 
that 1/N <e 2 . Now, if n > N, then 

\a n -0\ = - i r <-^= = J^<V^=e, (2.5) 

V n VN V N 

which shows that = liml/y/n. 

2.3: 
REMARK A good way to attack a limit problem is to immediately examine the quantity \a n — L\, 
which is what we did in Example 2.2 above. This is the quantity we eventually wish to show is less 
than e when n > N, and determining which N to use is always the hard part. Ordinarily, some 
algebraic manipulations can be performed on the expression \a n — L\ that can help us figure out 
exactly how to choose N. Just know that this process takes some getting used to, so practice! 

Exercise 2.1 

a. Using the basic definition, prove that Um3/ (2n + 7) = 0. 

b. Using the basic definition, prove that liml/n 2 = 0. 

c. Using the basic definition, prove that lira (n 2 + l) / (n 2 + lOOn) = 1. HINT: Use the idea 
from the remark above; i.e., examine the quantity \a n — L\. 

d. Again, using the basic definition, prove that 

n + n 2 i 
Urn 5- = -l- (2.6) 

n — n 2 i 

Remember the definition of the absolute value of a complex number. 
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e. Using the basic definition, prove that 

n 3 + n 2 i . 

Lvm — = i. (2.7) 

1 — n A i 

f. Let a n = (—1)™- Prove that 1 is not the limit of the sequence {a n }. HINT: Suppose the 
sequence {a n } does converge to 1. Use e = 1, let N be the corresponding integer that exists 
in the definition, satisfying \a n — 1| < 1 for all n > N, and then examine the quantity \a n — 1| 
for various n's to get a contradiction. 

Exercise 2.2 

a. Let {a n } be a sequence of (real or complex) numbers, and let L be a number. Prove that 
L = lima n if and only if for every positive integer k there exists an integer N, such that if 
n> N then \a n — L\ < 1/fc. 

b. Let {c„} be a sequence of complex numbers, and suppose that c„ i— > L. If c n = a n + b n i and 
L = a + bi, show that a = lima n and b = limb n . Conversely, if a = lima n and b = limb n , 
show that a + bi = Urn (a n + b n i) . That is, a sequence {c„ = a n + b n i} of complex numbers 
converges if and only if the sequence {a n } of the real parts converges and the sequence {&„} of 
the imaginary parts converges. HINT: You need to show that, given some hypotheses, certain 
quantities are less than e. Part (c) of Exercise 1.25 should be of help. 

Exercise 2.3 

a. Prove that a constant sequence (a n = c) converges to c. 

b. Prove that the sequence { ^l^ 1 } diverges to — oo. 

c. Prove that the sequence {(—1)"} does not converge to any number L. HINT: Argue by con- 
tradiction. Suppose it does converge to a number L. Use e = 1/2, let N be the corresponding 
integer that exists in the definition, and then examine \a n — a n +i\ f° r n > N. Use the following 
useful add and subtract trick: 



ln+l\ 



L + L - a„+i| < \a„ - L\ + \L - a n+1 \. (21 



2.3 Existence of Certain Fundamental Limits 3 

We have, in the preceding exercises, seen that certain specific sequences converge. It's time to develop some 
general theory, something that will apply to lots of sequences, and something that will help us actually 
evaluate limits of certain sequences. 

Definition 2.3: 

A sequence {a n } of real numbers is called nondecreasing if o„ < a n+ i for all n, and it is called 
nonincreasing if a n > a n +i for all n. It is called strictly increasing if a n < a n+ i for all n, and 
strictly decreasing if a n > a„ +1 for all n. 

A sequence {«„} of real numbers is called eventually nondecreasing if there exists a natural 
number N such that a n < o n +i for all n > N, and it is called eventually nonincreasing if there 
exists a natural number N such that a n > a n +i for all n > N. We make analogous definitions of 
"eventually strictly increasing" and "eventually strictly decreasing." 

It is ordinarily very difficult to tell whether a given sequence converges or not; and even if we know in 
theory that a sequence converges, it is still frequently difficult to tell what the limit is. The next theorem is 



3 This content is available online at <http://cnx.Org/content/m36120/l.2/>. 
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therefore very useful. It is also very fundamental, for it makes explicit use of the existence of a least upper 
bound. 

Theorem 2.1: 

Let {a n } be a nondecreasing sequence of real numbers. Suppose that the set S of elements of the 
sequence {a n } is bounded above. Then the sequence {a n } is convergent, and the limit L is given 
byL = supS = supa n . 

Analogously, if {a n } is a nonincreasing sequence that is bounded below, then {a n } converges to 

in fan- 
Proof: 

We prove the first statement. The second is done analogously, and we leave it to an exercise. Write 
L for the supremum supa n . Let e be a positive number. By Theorem 1.5, there exists an integer TV 
such that ajy > L — e, which implies that L — aN < £■ Since {a n } is nondecreasing, we then have 
that a n > aN > L — e for all n > N. Since L is an upper bound for the entire sequence, we know 
that L > a n for every n, and so we have that 

\L — a n \ = L — a n < L — a^r < £ (2-9) 

for all n > TV. This completes the proof of the first assertion. 

Exercise 2.4 

a. Prove the second assertion of the preceding theorem. 

b. Show that Theorem 2.1, p. 33 holds for sequences that are eventually nondecreasing or even- 
tually nonincreasing. (Re-read the remark following the definition of the limit of a sequence.) 

The next exercise again demonstrates the "denseness" of the rational and irrational numbers in the set R of 
all real numbers. 

Exercise 2.5 

a. Let a; be a real number. Prove that there exists a sequence {r„} of rational numbers such that 
x = limr n . In fact, show that the sequence {r n } can be chosen to be nondecreasing. HINT: 
For example, for each n, use Theorem 1.8, p. 18 to choose a rational number r„ between 
x — 1/n and x. 

b. Let a; be a real number. Prove that there exists a sequence {r'„} of irrational numbers such 
that x = limr n . 

c. Let z = x + iy be a complex number. Prove that there exists a sequence {a n } = {/3„ + i^ n } 
of complex numbers that converges to z, such that each (3 n and each 7„ is a rational number. 

Exercise 2.6 

Suppose {a n } and {&„} are two convergent sequences, and suppose that lima n = a and limb n = b. 
Prove that the sequence {a n + b n } is convergent and that 

lira (a n + b n ) = a + b. (2-10) 

HINT: Use an e/2 argument. That is, choose a natural number Nx so that \a n — a\ < e/2 for all 
n > N\, and choose a natural number ./V2 so that \b n — b\ < e/2 for all n > N2- Then let TV be the 
larger of the two numbers TVi and TV2. 

The next theorem establishes the existence of four nontrivial and important limits. This time, the proofs 
are more tricky. Some clever idea will have to be used before we can tell how to choose the TV. 

Theorem 2.2: 

1. Let z g C satisfy \z\ < 1, and define a n = z n . then the sequence {a n } converges to 0. We 
write limz n = 0. 
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2. Let b be a fixed positive number greater than 1, and define a n = b 1 / 71 . See Theorem 1.11, p. 
21. Then lima n = 1. Again, we write limb 1 /"- = 1. 

3. Let b be a positive number less than 1. Then limb 1 / 71 = 1. 

4. If a n = n 1 /™, then lima n = limn 1 / 71 = 1. 

Proof: 

We prove parts (1) and (2) and leave the rest of the proof to the exercise that follows. If z = 0, 
claim (1) is obvious. Assume then that z ^ 0, and let e > be given. Let w = l/\z\, and observe 
that w > 1. So, we may write w = 1 + h for some positive h. (That step is the clever idea for this 
argument.) Then, using the Binomial Theorem, w n > nh, and so l/w n < 1/ (nh) . See part (a) of 
Exercise 1.20. But then 

\z n - 0| = \z n \ = \z\ n = (l/w) n = 1/w 71 < 1/ (nh) • (2.11) 

So, if TV is any natural number larger than 1/ (eh) , then 

\z n -0\ = \z n \ = \z\ n <±-<-!-<e (2.12) 

nh Nh 

for all n > N. This completes the proof of the first assertion of the theorem. 

To see part (2), write a n = b 1 / 71 = l+x n , i.e., x n = b 1 / 71 — 1, and observe first that x n > 0. Indeed, 
since b > 1, it must be that the nth root 6 1 /™ is also > 1. (Why?) Therefore, x n = b 1 / 71 — 1 > 0. 
(Again, writing 6 1 /™ as 1 + x n is the clever idea.) Now, b = b 1 / 71 = (1 + x n ) n , which, again by the 
Binomial Theorem, implies that b > 1 + nx n . So, x n < (b— 1) /n, and therefore 

I6 1 /™ _ i| = 6 i/n _ 1 = Xn< b _ll < e (2.13) 

n 

whenever n > e/ (b — 1) , and this proves part (2). 

Exercise 2.7 

a. Prove part (3) of the preceding theorem. HINT: For b < 1, use the following algebraic 
calculation: 

I&V" _ !| = & i/"|! _ (l/b) 1/n \ < |1 - (l/6) 1/n |, (2.14) 

and then use part (2) as applied to the positive number 1/6. 

b. Prove part (4) of the preceding theorem. Explain why it does not follow directly from part 
(2). HINT: Write n 1 /™ = 1 + h n . Observe that h n > 0. Then use the third term of the 
binomial theorem in the expansion n = (1 + h n ) n . 

c. Construct an alternate proof to part (2) of the preceding theorem as follows: Show that the 
sequence {6 1 /™} is nonincreasing and bounded below by 1. Deduce, from Theorem 2.1, p. 33, 
that the sequence converges to a number L. Now prove that L must be 1. 



2.4 Definition of e 4 

Part (4) of Theorem 2.2, p. 33 raises an interesting point. Suppose we have a sequence {a n }, like {n}, that is 
diverging to infinity, and suppose we have another sequence {b n }, like {1/n}, that is converging to 0. What 
can be said about the sequence {a^ 1 }? The base o„ is blowing up, while the exponent b n is going to 0. In 
other words, there are two competing processes going on. If a n is blowing up, then its powers ought to be 
blowing up as well. On the other hand, anything to the power should be 1, so that, as the exponents of the 



4 This content is available online at <http://cnx.Org/content/m36124/l.2/>. 
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elements of a sequence converge to 0, the sequence ought to converge to 1. This competition between the 
convergence of the base to infinity and the convergence of the exponent to makes it subtle, if not impossibly 
difficult, to tell what the combination does. For the special case of part (4) of Theorem 2.2, p. 33, the answer 
was 1, indicating that, in that case at least, the exponents going to seem to be more important than the 
base going to infinity. One can think up all kinds of such examples: {(2™) },{(rc!) }i{(ro!) }i an d so 
on. We will see later that all sorts of things can happen. 

Of course there is the reverse situation. Suppose {a n } is a sequence of numbers that decreases to 1, and 
suppose {&„} is a sequence of numbers that diverges to infinity. What can we say about the sequence {a n bn }7 
The base is tending to 1, so that one might expect that the whole sequence also would be converging to 1. 
On the other hand the exponents are blowing up, so that one might think that the whole sequence should 
blow up as well. Again, there are lots of examples, and they don't all work the same way. Here is perhaps 
the most famous such example. 

Theorem 2.3: Definition of e. 

For n > 1, define a n = (1 + l/n) n . Then the sequence {a n } is nondecreasing and bounded above, 
whence it is convergent. (We will denote the limit of this special sequence by the letter e.) 
Proof: 

To see that {a n } is nondecreasing, it will suffice to prove that a n+ i/a n > 1 for all n. In the 
computation below, we will use the fact (part (c)of Exercise 1.20) that if x > — 1 then (1 + x) n > 
1 + nx. So, 



rcl- 



jX^ + l 



n+l n+2 n + 1 
n n+l 



1 \ _ n+l n 

n+l / n n+l 



/ n+ l\n+l _ n+1 / n 2 +2 n \" +1 _ n+l/-, 1 \" +1 > n+l ( -i / , i \ / 1 \\_n+l(-, 

\ n ) — n \n 2 +2n+lj ~ n \ l (n+l) 2 / - n I 1 V 1 ^ 1 ) \n+l ) I ~ n \ l 

(2.2) 



as desired. 

We show next that {a n } is bounded above. This time, we use the binomial theorem, the geometric 
progression, and Exercise 1.19. 

rcla n = (1+^)" 

■spn (n\ (l\ k 
, r n (l\k I 2 - 16 ) 

1 i-i 

< 4, 

as desired. 

That the sequence {a n } converges is now a consequence of Theorem 2.1, p. 33. 

2.4: 

REMARK We have now defined the real number e. Its central role in mathematics is not at all 
evident yet; at this point we have no definition of exponential function, logarithm, or trigonometric 
functions. It does follow from the proof above that e is between 2 and 4, and with a little more 
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careful estimates we can show that actually e < 3. For the moment, we will omit any further 
discussion of its precise value. Later, in Exercise 4.19, we will show that it is an irrational number. 

2.5 Properties of Convergent Sequences 5 

Often, our goal is to show that a given sequence is convergent. However, as we study convergent sequences, 
we would like to establish various properties that they have in common. The first theorem of this section is 
just such a result. 

Theorem 2.4: 

Suppose {a n } is a convergent sequence of real or complex numbers. Then the sequence {a n } forms 
a bounded set. 
Proof: 

Write L = lima n . Let e be the positive number 1. Then, there exists a natural number N such that 
\a n — L\ < 1 for all n > N. By the backward triangle inequality, this implies that ||a„| — |L|| < 1 
for all n > N, which implies that \a n \ < \L\ + 1 for all n > N. This shows that at least the tail of 
the sequence is bounded by the constant \L\ + 1. 

Next, let K be a number larger than the finitely many numbers |oi|, ..., |o.jv-i|- Then, for any 
n,|a„| is either less than K or \L\ + 1. Let M be the larger of the two numbers K and \L\ + 1. Then 
\a n \ < M for all n. Hence, the sequence {a n } is bounded. 

Note that the preceding theorem is a partial converse to Theorem 2.1, p. 33; i.e., a convergent sequence 
is necessarily bounded. Of course, not every convergent sequence must be either nondecreasing or nonin- 
creasing, so that a full converse to Theorem 2.1, p. 33 is not true. For instance, take z = —1/2 in part (1) 
of Theorem 2.2, p. 33. It converges to all right, but it is neither nondecreasing nor nonincreasing. 

Exercise 2.8 

a. Suppose {a n } is a sequence of real numbers that converges to a number a, and assume that 
a n > c for all n. Prove that a> c. HINT: Suppose not, and let e be the positive number c— a. 
Let N be a natural number corresponding to this choice of e, and derive a contradiction. 

b. If {a n } is a sequence of real numbers for which lima n = a, and if a / 0, then prove that 
a n / for all large enough n. Show in fact that there exists an TV such that \a n \ > \a\/2 for 
all n> N. HINT: Make use of the positive number e = |o|/2. 

Exercise 2.9 

a. If {a n } is a sequence of positive real numbers for which lima n = a > 0, prove that liniy/a^ = 
y/a. HINT: Multiply the expression ^fa^ — y/a above and below by ^/a^ + y/a. 

b. If {a n } is a sequence of complex numbers, and lima n = a, prove that Zim|a„| = \a\. HINT: 
Use the backward triangle inequality. 

Exercise 2.10 

Suppose {a n } is a sequence of real numbers and that L = lima n . Let Mi and M 2 be real numbers 
such that Mi < a n < M 2 for all n. Prove that Mi < L < M 2 . 

HINT: Suppose, for instance, that L > M 2 - Make use of the positive number L — M 2 to derive 
a contradiction. 

We are often able to show that a sequence converges by comparing it to another sequence that we already 
know converges. The following exercise demonstrates some of these techniques. 

Exercise 2.11 

Let {a n } be a sequence of complex numbers. 



5 This content is available online at <http://cnx.Org/content/m36126/l.2/>. 
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a. Suppose that, for each n,|a„| < 1/n. Prove that = lima n . 

b. Suppose {&„} is a sequence that converges to 0, and suppose that, for each n,|a n | < \b n \. 
Prove that = lima n . 

The next result is perhaps the most powerful technique we have for showing that a given sequence converges 
to a given number. 

Theorem 2.5: Squeeze Theorem 

Suppose that {a n } is a sequence of real numbers and that {b n } and {c„} are two sequences of real 
numbers for which b n < a n < c„ for all n. Suppose further that limb n = limc n = L. Then the 
sequence {a n } also converges to L. 
Proof: 

We examine the quantity \a n — L, | employ some add and subtract tricks, and make the following 
computations: 

rcl\a n -L\ < \a n -b n + b n -L\ 

< \a„ - b„\ + \b n - L\ 

a n -b n + \b n - L\ 

< c n -b n + \b„ - L\ 
= \c n ~ b n \ + \b n - L\ 

< \c n - L\ + \L - b n \ + \b n - L\. 

So, we can make \a n — L\ < e by making \c n — L\ < e/3 and \b n — L\ < e/3. So, let iVi be a positive 
integer such that \c n — L\ < e/3 if n > Ni, and let N 2 be a positive integer so that \b n — L\ < e/3 if 
n > N 2 . Then set N = max (Ni,N 2 ) ■ Clearly, if n > N, then both inequalities \c n — L\ < e/3 and 
\b n — L\ < e/3, and hence \a n — L\ < e. This finishes the proof. 

The next result establishes what are frequently called the "limit theorems." Basically, these results show 
how convergence interacts with algebraic operations. 

Theorem 2.6: 

Let {a n } and {&„} be two sequences of complex numbers with a = lima n and b = limb n . Then 

1. The sequence {a n + b n } converges, and 

lim (a n + b n ) = lima n + limb n = a + b. (2-18) 

2. The sequence {a n b n } is convergent, and 

lim(a n b n ) = lima n limb n = ab. (2-19) 

3. If all the 6„'s as well as b are nonzero, then the sequence {a n /b n } is convergent, and 

fa n lima n a 
hm — = — — - = -.(2.20) 
\ b n tvmb n b 

Proof: 

Part (1) is exactly the same as Exercise 2.6. Let us prove part (2). 

By Theorem 2.4, p. 36, both sequences {a n } and {b n } are bounded. Therefore, let M be a 
number such that \a n \ < M and \b n \ < M for all n. Now, let e > be given. There exists an Ni 
such that \a n — a\ < ej (2M) whenever n > N\, and there exists an ^2 such that \b n — b\ < e/ (2M) 
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whenever n > TV2. Let N be the maximum of N\ and TV2. Here comes the add and subtract trick 
again. 

rcl\a n b n — ab\ = \a n b n — ab n + ab n — ab\ 

< \a n b n - ab n \ + \ab n - ab\ 

= \a n - a\\b n \ + \a\\b- b n \ (2-21) 

< \a n - a\M + M\b n - b\ 

< e 

if n > N, which shows that lira (a n b n ) = ab. 

To prove part (3), let M be as in the previous paragraph, and let e > be given. There exists 
an 7V"i such that \a n — a\ < (e\b\ 2 ) / {AM) whenever n > N\; there also exists an N2 such that 
\b n — b\ < (s\b\ 2 ) / (AM) whenever n > N2; and there exists an iV3 such that \b n \ > \b\/2 whenever 
n > -^3- (See Exercise 2.8.) Let TV be the maximum of the three numbers Ni, N2 and 7V3. Then: 

rrl\ a ™ a \ — I anb-b n a | 

lci \b n fo I — I b n b I 

= \a n b-b n a\Tj± 



\b„b\ 
< \a„b - b n a\j^-^ 



(2.22) 
2 v ; 



< (\a„ - a\\b\ + \a\\b n - b\) |fc|2 

< (M\a n - a\ + M\b n - b\) -ff 

< e 
if n > TV. This completes the proof. 

2.5: 

REMARK The proof of part (3) of the preceding theorem may look mysterious. Where, for 
instance, does this number e|6| /AM come from? The answer is that one begins such a proof by 
examining the quantity \a n /b n — a/b\ to see if by some algebraic manipulation one can discover how 
to control its size by using the quantities \a n — a\ and \b n — b\. The assumption that a = lima n and 
b = limb n mean exactly that the quantities \a n — a\ and \b n — b\ can be controlled by requiring n 
to be large enough. The algebraic computation in the proof above shows that 



<{M\a n -a\ + M\b n -b\)^, (2.23) 



\°^ - a -\< {M\a n - a\ + M\b n -b\) ^- 

On |fr| 

and one can then see exactly how small to make \a n — a\ and \b n — b\ so that \a n /b n — a/b\ < e. 
Indeed, this is the way most limit proofs work. 

Exercise 2.12 

If possible, determine the limits of the following sequences by using Theorem 2.2, p. 33, Theo- 
rem 2.3, Definition of e., p. 35, Theorem 2.6, p. 37, and the squeeze theorem Theorem 2.5, Squeeze 
Theorem, p. 37. 

a. {n 1 /" 2 }. 



b. 


{(- 2 ) 17 "}. 


c. 


{(1 + n) 1 /"}. 


d. 


{(1 + - 2 ) 17 " 3 } 


0. 


{(1 + 1/n) 2 /"} 


f. 


{(1 + 1/n) 2 "}. 
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g- {(1 + 1/n)"}. 

h. {(1 - 1/n)"}. HINT: Note that 



n- 1 1 



1 " V" = — - = ^ = ^T+T = rT-T-- ( 2 - 24 ) 



n-l n-1 ' n-1 



i. {(l-l/(2n)) 3 "}. 
J- {(n!) 1/n }- 



2.6 Subsequences and Cluster Points 6 

Definition 2.4: 

Let {a n } be a sequence of real or complex numbers. A subsequence of {a n } is a sequence {bk} that 
is determined by the sequence {a n } together with a strictly increasing sequence {n^} of natural 
numbers. The sequence {bk} is defined by bk = a nk . That is, the fcth term of the sequence {bk} is 
the rifcth term of the original sequence {a n }. 

Exercise 2.13 

Prove that a subsequence of a subsequence of {a n } is itself a subsequence of {a„}. Thus, let {a n } 
be a sequence of numbers, and let {bk} = {a„J be a subsequence of {a n }. Suppose {cj} = {bk } is 
a subsequence of the sequence {bk}- Prove that {cj} is a subsequence of {a n }. What is the strictly 
increasing sequence {rrij} of natural numbers for which Cj = a m ? 

Here is an interesting generalization of the notion of the limit of a sequence. 

Definition 2.5: 

Let {a n } be a sequence of real or complex numbers. A number x is called a cluster point of the 
sequence {a n } if there exists a subsequence {bk} of {a n } such that x = limbk- The set of all cluster 
points of a sequence {a n } is called the cluster set of the sequence. 

Exercise 2.14 

a. Give an example of a sequence whose cluster set contains two points. Give an example of 
a sequence whose cluster set contains exactly n points. Can you think of a sequence whose 
cluster set is infinite? 

b. Let {a n } be a sequence with cluster set S. What is the cluster set for the sequence {— a n }l 
What is the cluster set for the sequence {a\}l 

c. If {&„} is a sequence for which b = limb n , and {a n } is another sequence, what is the cluster 
set of the sequence {a n b n }l 

d. Give an example of a sequence whose cluster set is empty. 

e. Show that if the sequence {a n } is bounded above, then the cluster set S is bounded above. 
Show also that if {a n } is bounded below, then S is bounded below. 

f. Give an example of a sequence whose cluster set S is bounded above but not bounded below. 

g. Give an example of a sequence that is not bounded, and which has exactly one cluster point. 

Theorem 2.7: 

Suppose {a n } is a sequence of real or complex numbers. 

1. (Uniqueness of limits) Suppose lima n = L, and lima n = M. Then L = M. That is, if the 
limit of a sequence exists, it is unique. 



6 This content is available online at <http://cnx.Org/content/m36129/l.2/>. 
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2. If L = lima n , and if {bk} is a subsequence of {a n }, then the sequence {bk} is convergent, 
and limbk = L. That is, if a sequence has a limit, then every subsequence is convergent and 
converges to that same limit. 

Proof: 

Suppose lima n = L&ndlima n = M. Let e be a positive number, and choose N\ so that \a n — L\ < e/2 
if n > Ni, and choose N 2 so that \a n — M\ < e/2 if n > N 2 - Choose an n larger than both A^andA^. 
Then 

\L - M\ = \L - a n + a n - M\ < \L - a n \ + \a n - M\ < e. (2.25) 

Therefore, since \L — M\ < e for every positive e, it follows that L — M = or L = M. This proves 
part (1). 

Next, suppose lima n = L and let {bk} be a subsequence of {a n }. We wish to show that limbk = 
L. Let e > be given, and choose an AT such that \a n — L\ < e if n > N. Choose a if so that 
n-K > AT. (How?) Then, if k > K, we have rik > uk > N, whence \bk — L\ = \a nk — L\ < e, which 
shows that limbk = L. This proves part (2). 

2.6: 

REMARK The preceding theorem has the following interpretation. It says that if a sequence 
converges to a number L, then the cluster set of the sequence contains only one number, and that 
number is L. Indeed, if a; is a cluster point of the sequence, then there must be some subsequence 
that converges to x. But, by part (2), every subsequence converges to L. Then, by part (1), x = L. 
Part (g) of Exercise 2.14 shows that the converse of this theorem is not valid, that is, the cluster 
set may contain only one point, and yet the sequence is not convergent. 

We give next what is probably the most useful fundamental result about sequences, the Bolzano- 
Weierstrass Theorem. It is this theorem that will enable us to derive many of the important properties 
of continuity, differentiability, and integrability. 

Theorem 2.8: Bolzano- Weierstrass 

Every bounded sequence {a n } of real or complex numbers has a cluster point. In other words, 
every bounded sequence has a convergent subsequence. 

The Bolzano- Weierstrass Theorem is, perhaps not surprisingly, a very difficult theorem to prove. 
We begin with a technical, but very helpful, lemma. 

Lemma 2.1: 

Let {a n } be a bounded sequence of real numbers; i.e., assume that there exists an M such that 
\o-n\ < M for all n. For each n > 1, let S n be the set whose elements are {a n , o n +i, a n+ 2, ■■■}■ That 
is, S n is just the elements of the tail of the sequence from n on. Define x n = supS n = sup k>n ak- 
Then 

1. The sequence {x n } is bounded (above and below). 

2. The sequence {x n } is non-increasing. 

3. The sequence {x n } converges to a number x. 

4. The limit x of the sequence {x n } is a cluster point of the sequence {a n }. That is, there exists 
a subsequence {bk} of the sequence {a n } that converges to x. 

5. If y is any cluster point of the sequence {a n }, then y < x, where x is the cluster point of part 
(4). That is, x is the maximum of all cluster points of the sequence {a n }. 

Proof: 

Since x n is the supremum of the set S n , and since each element of that set is bounded between 
— M and M, part (1) is immediate. 



41 

Since S n +i C S n , it is clear that 

z n+ i = supS n+ i < swpS„ = x n , (2.26) 

showing part (2). 

The fact that the sequence {x n } converges to a number x is then a consequence of Theorem 2.1, 
p. 33. 

We have to show that the limit x of the sequence {x n } is a cluster point of {a„}. Notice that 
{x n } may not itself be a subsequence of {a„}, each x n may or may not be one of the numbers 
Ofe, so that there really is something to prove. In fact, this is the hard part of this lemma. To 
finish the proof of part (4), we must define an increasing sequence {rik} of natural numbers for 
which the corresponding subsequence {&&} = {a nk } of {a n } converges to x. We will choose these 
natural numbers {nu] so that \x — a nk \ < 1/fc. Once we have accomplished this, the fact that the 
corresponding subsequence {a nk } converges to x will be clear. We choose the rifc's inductively. 
First, using the fact that x = limx n , choose an n so that \x n — x\ = x n — x < 1/1. Then, because 
x n = supS n , we may choose by Theorem 1.5, p. 14 some m > n such that x n > a m > x n — 1/1. 
But then \a m — x\ < 1/1. (Why?) This m we call n\. We have that \a ni — x\ < 1/1. 

Next, again using the fact that x = limx n , choose another n so that n > m and so that 
\x n — x\ = x n — x < 1/2. Then, since this x n = supS n , we may choose another m > n such that 
x n > a m > x n — 1/2- This m we call n 2 . Note that we have |a„ 2 — x\ < 1/2. 

Arguing by induction, if we have found an increasing set ri\ < n 2 < ... < rij, for which \a n . —x\ < 
1/i for 1 < i < j, choose an n larger than rij such that \x n — x\ < 1/ (j + 1) . Then, since x n = supS n , 
choose an m > n so that x n > a m > x n — 1/ (j + 1) . Then \a m — x\ < 1/ (j + 1), and we let nj+i 
be this to. It follows that |a n+1 — x\ < 1/ (j + 1) . 

So, by recursive definition, we have constructed a subsequence of {a n } that converges to x, and 
this completes the proof of part (4) of the lemma. 

Finally, if y is any cluster point of {a„}, and if y = lima nh , then n^ > fc, and so a nk < Xk, 
implying that Xk — a„ t > 0. Hence, taking limits on fc, we see that x — y > 0, and this proves part 
(5). 

Now, using the lemma, we can give the proof of the Bolzano- Weierstrass Theorem. 

Proof: 

If {a n } is a sequence of real numbers, this theorem is an immediate consequence of part (4) of the 
preceding lemma. 

If a n = b n + c n i is a sequence of complex numbers, and if {a n } is bounded, then {&„} and {c„} 
are both bounded sequences of real numbers. See Exercise 1.27. So, by the preceding paragraph, 
there exists a subsequence {b nk } of {&„} that converges to a real number b. Now, the subsequence 
{c„ fc } is itself a bounded sequence of real numbers, so there is a subsequence {c nk } that converges 
to a real number c. By part (2) of Theorem 2.7, p. 39, we also have that the subsequence {b nk } 
converges to b. So the subsequence {a nk } = {b nk +c nk i) of {a n } converges to the complex number 
b + ci; i.e., {a„} has a cluster point. This completes the proof. 

There is an important result that is analogous to the Lemma above, and its proof is easily adapted from 
the proof of that lemma. 

Exercise 2.15 

Let {a n } be a bounded sequence of real numbers. Define a sequence {y n } by y n = in/k^na^- Prove 
that: 

a. {y n } is nondecreasing and bounded above. 

b. y = limy n is a cluster point of {a n }. 
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c. If z is any cluster point of {a n }, then y < z. That is, y is the minimum of all the cluster 
points of the sequence {a n }. HINT: Let {a n } = {— a n }, and apply the preceding lemma to 
{a„). This exercise will then follow from that. 

The Bolzano- Wierstrass Theorem shows that the cluster set of a bounded sequence {a n } is nonempty. It is 
also a bounded set itself. 

The following definition is only for sequences of real numbers. However, like the Bolzano- Weierstrass 
Theorem, it is of very basic importance and will be used several times in the sequel. 

Definition 2.6: 

Let {a n } be a sequence of real numbers and let S denote its cluster set. 

If S is nonempty and bounded above, we define lim supa n to be the supremum supS of S. 

If S is nonempty and bounded below, we define lim infa n to be the infimum infS of S. 

If the sequence {a n } of real numbers is not bounded above, we define lim supa n to be oo, and 
if {a n } is not bounded below, we define lim infa n to be — oo. 

If {«n} diverges to oo, then we define lim supa n and lim infa n both to be oo. And, if {a n } 
diverges to — oo, we define lim supa n and lim infa n both to be — oo. 

We call lim supa n the limit superior of the sequence {a n }, and lim infa n the limit inferior of 
{a n }- 
Exercise 2.16 

a. Suppose {a n } is a bounded sequence of real numbers. Prove that the sequence {x n } of the 
lemma following Theorem 2.8, Bolzano- Weierstrass, p. 40 converges to lim supa n . Show also 
that the sequence {y n } of Exercise 2.15 converges to lim infa n . 

b. Let {a n } be a not necessarily bounded sequence of real numbers. Prove that 

lim supa n = infsupak = limsupak- (2.27) 

n k>n n k>n 

and 

lim infa n = supinfa^ = liminfk > na^. (2.28) 

n k>n n 

HINT: Check all cases, and use Lemma 2.1, p. 40 and Exercise 2.15. 

c. Let {a n } be a sequence of real numbers. Prove that 

lim supa n = —lim inf (— a n ) . (2.29) 

d. Give examples to show that all four of the following possibilities can happen. 

a. lim supa n is finite, and lim infa n = — oo. 

b. lim supa n = oo and lim infa n is finite. 

c. lim supa n = oo and lim infa n = — oo. 

d. both lim supa n and lim infa n are finite. 

The notions of limsup and liminf are perhaps mysterious, and they are in fact difficult to grasp. The 
previous exercise describes them as the resultof a kind of two-level process, and there are occasions when 
this description is a great help. However, the limsup and liminf can also be characterized in other ways that 
are more reminiscent of the definition of a limit. These other ways are indicated in the next exercise. 

Exercise 2.17 

Let {a n } be a bounded sequence of real numbers with 

lim supa n = L and lim infa n = I. Prove that L and I satisfy the following properties. 

a. For each e > 0, there exists an N such that a n < L + e for all n > N. HINT: Use the fact 
that lim supa n = L is the number x of the lemma following Theorem 2.8, and that x is the 
limit of a specific sequence {x n }. 
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b. For each e > 0, and any natural number k, there exists a natural number j > k such that 
cij > L — e. Same hint as for part (a). 

c. For each e > 0, there exists an N such that a n > I — e for all n> N. 

d. For each e > 0, and any natural number k, there exists a natural number j > k such that 
dj < I + s. 

e. Suppose L' is a number that satisfies parts (a) and (b). Prove that L' is the limsup of {a n }. 
HINT: Use part (a) to show that L' is greater than or equal to every cluster point of {a n }. 
Then use part (b) to show that L is less than or equal to some cluster point. 

f. If l' is any number that satisfies parts (c) and (d), show that l' is the liminf of the sequence 
{a n }- 

Exercise 2.18 

a. Let {a n } and {b n } be two bounded sequences of real numbers, and write L = lira supa n 
and M = lim supb n . Prove that Urn sup(a n + b n ) < Urn supa n + lim supb n . HINT: Using 
part (a) of the preceding exercise, show that for every e > there exists a N such that 
a n + b n < L + M + e for all n > N, and conclude from this that every cluster point y 
of the sequence {a n + b n } is less than or equal to L + M. This will finish the proof, since 
lim sup (a n + b n ) is a cluster point of that sequence. 

b. Again, let {a n } and {&„} be two bounded sequences of real numbers, and write I = lim infa n 
and m = lim infb n . Prove that lim inf (a n + b n ) > lim infa n + lim infb n . HINT: Use part 
(c) of the previous exercise. 

c. Find examples of sequences {a n } and {b n } for which lim supa n = lim supb„ = 1, but 
lim sup (a n + b n ) = 0. 

We introduce next another property that a sequence can possess. It looks very like the definition of a 
convergent sequence, but it differs in a crucial way, and that is that this definition only concerns the elements 
of the sequence {o„} and not the limit L. 

Definition 2.7: 

A sequence {a n } of real or complex numbers is a Cauchy sequence if for every e > 0, there exists 
a natural number N such that if n > N and m > N then \a n — a m \ < e. 

2.7: 

REMARK No doubt, this definition has something to do with limits. Any time there is a positive 
e and an N, we must be near some kind of limit notion. The point of the definition of a Cauchy 
sequence is that there is no explicit mention of what the limit is. It isn't that the terms of the 
sequence are getting closer and closer to some number L, it's that the terms of the sequence are 
getting closer and closer to each other. This subtle difference is worth some thought. 

Exercise 2.19 

Prove that a Cauchy sequence is bounded. (Try to adjust the proof of Theorem 2.4, p. 36 to work 
for this situation.) 

The next theorem, like the Bolzano- Weierstrass Theorem, seems to be quite abstract, but it also turns 
out to be a very useful tool for proving theorems about continity, differentiability, etc. In the proof, the 
completeness of the set of real numbers will be crucial. This theorem is not true in ordered fields that are 
not complete. 

Theorem 2.9: Cauchy Criterion 

A sequence {a n } of real or complex numbers is convergent if and only if it is a Cauchy sequence. 
Proof: 

If lima n = a then given e > 0, choose N so that |ofc — a\ < e/2 if k > N. From the triangle 
inequality, and by adding and subtracting a, we obtain that \a n — a m \ < e if n > N and m > N. 
Hence, if {a n } is convergent, then {a n } is a Cauchy sequence. 
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Conversely, if {a n } is a cauchy sequence, then {a n } is bounded by the previous exercise. Now 
we use the fact that {a n } is a sequence of real or complex numbers. Let a; be a cluster point of 
{a n }. We know that one exists by the Bolzano- Weierstrass Theorem. Let us show that in fact this 
number x not only is a cluster point but that it is in fact the limit of the sequence {a n }. Given 
£ > 0, choose JVso that \a n — a m \ < e/2 whenever both n and m> N. Let {a nk } be a subsequence 
of {a n } that converges to x. Because {n^} is strictly increasing, we may choose a A; so that n^ > N 
and also so that \a nk — x\ < e/2. Then, if n > N, then both n and this particular rik are larger 
than or equal to N. Therefore, \a n — x\ < \a n — a nk | + \a nk — x\ < e. this completes the proof that 
x = lima„. 



2.7 A Little Topology 7 

We now investigate some properties that subsets of R and C may possess. We will define "closed sets," "open 
sets," and "limit points" of sets. These notions are the rudimentary notions of what is called topology. As 
in earlier definitions, these topological ones will be enlightening when we come to continuity. 

Definition 2.8: 

Let S be a subset of C. A complex number x is called a limit point of S if there exists a sequence 
{x n } of elements of S such that x = limx n . 

A set S C C is called closed if every limit point of S belongs to S. 

Every limit point of a set of real numbers is a real number. Closed intervals [a, b] are examples of closed 
sets in R, while open intervals and half-open intervals may not be closed sets. Similarly, closed disks B r (c) 
of radius r around a point c in C, and closed neighborhoods N r (S) of radius r around a set S C C, are 
closed sets, while the open disks or open neighborhoods are not closed sets. As a first example of a limit 
point of a set, we give the following exercise. 

Exercise 2.20 

Let S be a nonempty bounded set of real numbers, and let M = supS. Prove that there exists 
a sequence {a n } of elements of S such that M = lima n . That is, prove that the supremum of a 
bounded set of real numbers is a limit point of that set. State and prove an analogous result for 
infs. 

HINT: Use Theorem 1.5, p. 14, and let e run through the numbers 1/n. 

Exercise 2.21 

a. Suppose S is a set of real numbers, and that z = a + bi € C with 6/0. Show that z is not a 
limit point of S. That is, every limit point of a set of real numbers is a real number. HINT: 
Suppose false; write a + bi = limx n , and make use of the positive number \b\. 

b. Let c be a complex number, and let S = B r (c) be the set of all z € C for which \z — c\ < r. 
Show that S is a closed subset of C. HINT: Use part (b) of Exercise 2.9. 

c. Show that the open disk B r (0) is not a closed set in C by finding a limit point of B r (0) that 
is not in B r (0) . 

d. State and prove results analogous to parts b and c for intervals in R. 

e. Show that every element a; of a set S is a limit point of S. 

f. Let S be a subset of C, and let x be a complex number. Show that x is not a limit point of 
S if and only if there exists a positive number e such that if \y — x\ < e, then y is not in S. 
That is, S C\B e (x) = 0. HINT: To prove the " only if" part, argue by contradiction, and use 
the sequence {1/n} as e's. 

g. Let {a n } be a sequence of complex numbers, and let S be the set of all the o n 's. What is the 
difference between a cluster point of the sequence {a„} and a limit point of the set S? 



7 This content is available online at <http://cnx.Org/content/m36157/l.2/>. 
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h. (h) Prove that the cluster set of a sequence is a closed set. HINT: Use parts (e) and (f). 

Exercise 2.22 

a. Show that the set Q of all rational numbers is not a closed set. Show also that the set of all 
irrational numbers is not a closed set. 

b. Show that if S is a closed subset of R that contains Q, then S must equal all of R. 

Here is another version of the Bolzano- Weierstrass Theorem, this time stated in terms of closed sets rather 
than bounded sequences. 

Theorem 2.10: 

Let S be a bounded and closed subset of C. Then every sequence {x n } of elements of S has a 
subsequence that converges to an element of S. 
Proof: 

Let {xn} be a sequence in S. Since S is bounded, we know by Theorem 2.8, Bolzano- Weierstrass, 
p. 40 that there exists a subsequence {x nk } of {x n } that converges to some number x. Since each 
x nk belongs to S, it follows that x is a limit point of S. Finally, because S is a closed subset of C, 
it then follows that x e S. 

We have defined the concept of a closed set. Now let's give the definition of an open set. 

Definition 2.9: 

Let S be a subset of C. A point x € S is called an interior point of S if there exists an e > such 
that the open disk B e (x) of radius e around x is entirely contained in S. The set of all interior 
points of S is denoted by 5° and we call 5° the interior of S. 

A subset S of C is called an open subset of C if every point of S is an interior point of 5; i.e., 
if S = S°. 

Analogously, let S be a subset of R. A point x € S is called an interior point of S if there exists 
an e > such that the open interval (x — e, x + e) is entirely contained in S. Again, we denote the 
set of all interior points of S by S° and call S° the interior of S. 

A subset S of R is called an open subset of R if every point of S is an interior point of S; i.e., 
if S = S°. 
Exercise 2.23 

a. Prove that an open interval (a,b) in R is an open subset of R; i.e., show that every point of 
(a, 6) is an interior point of (a, b) . 

b. Prove that any disk B r (c) is an open subset of C. Show also that the punctured diskB r (c) is 
an open set, where B r (c) = {z : < \z — c\ < r}, i.e., evrything in the disk B r (c) except the 
central point c. 

c. Prove that the neighborhood N r (S) of radius r around a set S is an open subset of C. 

d. Prove that no nonempty subset of R is an open subset of C. 

e. (e) Prove that the set Q of all rational numbers is not an open subset of R. We have seen in 
part (a) of Exercise 2.22 that Q is not a closed set. Consequently it is an example of a set 
that is neither open nor closed. Show that the set of all irrational numbers is neither open 
nor closed. 

We give next a useful application of the Bolzano- Weierstrass Theorem, or more precisely an application of 
Theorem 2.10, p. 45. This also provides some insight into the structure of open sets. 

Theorem 2.11: 

Let S be a closed and bounded subset of C, and suppose S is a subset of an open set U. Then 
there exists an r > such that the neighborhood N r (S) is contained in U. That is, every open set 
containing a closed and bounded set S actually contains a neighborhood of S. 
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Proof: 

If S is just a singleton {x}, then this theorem is asserting nothing more than the fact that x is in 
the interior of U, which it is if U is an open set. However, when S is an infinite set, then the result is 
more subtle. We argue by contradiction. Thus, suppose there is no such r > for which N r (S) C U. 
then for each positive integer n there must be a point x n that is not in U, and a corresponding 
point y n g S, such that \x n — y n \ < 1/n. Otherwise, the number r = 1/n would satisfy the claim 
of the theorem. Now, because the y n 's all belong to S, we know from Theorem 2.10, p. 45 that a 
subsequence {y nk } of the sequence {y n } must converge to a number y s S. Next, we see that 

\x„ k -y\< \x nk -y„ k \ + \y nk -y\,< — + \y nk - y\, (2.30) 

and this quantity tends to 0. Hence, the subsequence {x Uk } of the sequence {x n } also converges 
to y. 

Finally, because y belongs to S and hence to the open set U, we know that there must exist an 
e > such that the entire disk B e (y) C U. Then, since the subsequence {x nk } converges to y, there 
must exist an/, such that \x nk — y\ < e, implying that x nk € B e (y) , and hence belongs to U. But 
this is our contradiction, because all of the a;„'s were not in U. So, the theorem is proved. 

We give next a result that clarifies to some extent the connection between open sets and closed sets. 
Always remember that there are sets that are neither open nor closed, and just because a set is not open 
does not mean that it is closed. 

Theorem 2.12: 

A subset S of C (R) is open if and only if its complement S = C \ S (R \ S) is closed. 
Proof: 

First, assume that S is open, and let us show that S is closed. Suppose not. We will derive a 
contradiction. Suppose then that there is a sequence {x n } of elements of S that converges to a 
number x that is not in S; i.e., x is an element of S. Since every element of S is an interior point of 
S, there must exist an e > such that the entire disk B £ (x) (or interval (x — e, x + e)) is a subset 
of S. Now, since x = limx n , there must exist aniV such that \x n — x\ < e for every n > TV. In 
particular, \xn — x\ < e; i.e., xn belongs to B e (x) (or (x — e,x + ej). This implies that xn e S. 
But xn € S, and this is a contradiction. Hence, if S is open, then S is closed. 

Conversely, assume that S is closed, and let us show that S must be open. Again we argue by 
contradiction. Thus, assuming that S is not open, there must exist a point x € S that is not an 
interior point of S. Hence, for every e > the disk B e (x) (or interval (x — e, x + e)) is not entirely 
contained in S. So, for each positive integer n, there must exist a point x n such that \x n — x\ < 1/n 
and x n $_ S. It follows then that x = limx n , and that each x n € S. Since S is a closed set, we must 
have that x E S. But x G S, and we have arrived at the desired contradiction. Hence, if S is closed, 
then S is open, and the theorem is proved. 

The theorem below, the famous Heine-Borel Theorem, gives an equivalent and different description of 
closed and bounded sets. This description is in terms of open sets, whereas the original definitions were 
interms of limit points. Any time we can find two very different descriptions of the same phenomenon, we 
have found something useful. 

Definition 2.10: 

Let S be a subset of C (respectively R. By an open cover of S we mean a sequence {U n } of open 
subsets of C (respectively R) such that S C UC/ n ; i.e., for every x € Sphere exists an n such that 
x e U n . 

A subset S of C (respectively R) is called compact, or is said to satisfy the Heine-Borel property, 
if every open cover of S has a finite subcover. That is, if {U n } is an open cover of S, then there 
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exists an integer N such that S C U^^E/,,. In other words, only a finite number of the open sets 
are necessary to cover S. 

2.8: 

REMARK The definition we have given here for a set being compact is a little less general 
from the one found in books on topology. We have restricted the notion of an open cover to be a 
sequence of open sets, while in the general setting an open cover is just a collection of open sets. 
The distinction between a sequence of open sets and a collection of open sets is genuine in general 
topology, but it can be disregarded in the case of the topological spaces R and C. 

Theorem 2.13: Heine-Borel Theorem 

A subset S of C (respectively R) is compact if and only if it is a closed and bounded set. 
Proof: 

We prove this theorem for subsets S of C, and leave the proof for subsets of R to the exercises. 

Suppose first that S C C is compact, i.e., satisfies the Heine-Borel property. For each positive 
integer n, define U n to be the open set B n (0) . Then S C L)U n , because C = L)U n . Hence, by 
the Heine-Borel property, there must exist an N such that S C U^ =1 C/„. But then S C Bn (0) , 
implying that S is bounded. Indeed, \x\ < N for all x € S. 

Next, still assuming that S is compact, we will show that S is closed by showing that S is open. 
Thus, let x be an element of S. For each positive integer n, define U n to be the complement of the 
closed set B l / n (x). Then each U n is an open set by Theorem 2.12, and we claim that {U n } is an 
open cover of S. Indeed, if y e S, then y ^ x, and \y — x\ > 0. Choose an n so that 1/n < \y — x\. 
Then y £ B\i n (a;), implying that y e U n . This proves our claim that {U n } is an open cover of S. 
Now, by the Heine-Borel property, there exists an N such that S C \J™_-JJ n . But this implies that 
for every z € S we must have \z — x\ > 1/N, and this implies that the disk Bi//v (x) is entirely 
contained in S. Therefore, every element x of S is an interior point of S. So, S is open, whence S 
is closed. This finishes the proof that compact sets are necessarily closed and bounded. 

Conversely, assume that S is both closed and bounded. We must show that S satisfies the Heine- 
Borel property. Suppose not. Then, there exists an open cover {U n } that has no finite subcover. So, 
for each positive integer n there must exist an element i„6 5 for which x n ^ U£ =1 E/fc. Otherwise, 
there would be a finite subcover. By Theorem 2.10, p. 45, there exists a subsequence {x n . } of {x n } 
that converges to an element x of S. Now, because {U n } is an open cover of S, there must exist an 
N such that x € Un- Because Un is open, there exists an e > so that the entire disk B £ (x) is 
contained in Un- Since x = limx n ., there exists a J so that \x n . — x\ < e if j > J. Therefore, if 
j > J, then x n . g Un- But the sequence {rij} is strictly increasing, so that there exists a, j > J 
such that rif > N, and by the choice of the point x n ., , we know that x n ,, £ \J^ =1 Uk- We have 
arrived at a contradiction, and so the second half of the theorem is proved. 

Exercise 2.24 

a. Prove that the union A U B of two open sets is open and the intersection A n B is also open. 

b. Prove that the union AuB of two closed sets is closed and the intersection Af)B is also closed. 
HINT: Use Theorem 2.12, p. 46 and the set equations A U B = A n B, and AnB = Au B. 
These set equations are known as Demorgan's Laws. 

c. Prove that the union Au B of two bounded sets is bounded and the intersection An B is also 
bounded. 

d. Prove that the union A U B of two compact sets is compact and the intersection A n B is also 
compact. 

e. Prove that the intersection of a compact set and a closed set is compact. 

f. Suppose S is a compact set in C and r is a positive real number. Prove that the closed 
neighborhood N r (S) of radius r around S is compact. HINT: To see that this set is closed, 
show that its coplement is open. 
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2.8 Infinite Series 8 

Probably the most interesting and important examples of sequences are those that arise as the partial sums 
of an infinite series. In fact, it will be infinite series that allow us to explain such things as trigonometric 
and exponential functions. 

Definition 2.11: 

Let {a n }g° be a sequence of real or complex numbers. By the infinite seriesY a n we mean the 
sequence {Sn} defined by 

N 

S N = J2 a n- (2.31) 

The sequence {Sn} is called the sequence of partial sums of the infinite series Y a n, an d the 
infinite series is said to be summable to a number S, or to be convergent, if the sequence {Sn} of 
partial sums converges to S'.The sum of an infinite series is the limit of its partial sums. 

An infinite series Y a n is called absolutely summable or absolutely convergent if the infinite 
series Y \a n \ is convergent. 

If Y a n is n°t convergent, it is called divergent. If it is convergent but not absolutely convergent, 
it is called conditionally convergent. 

A few simple formulas relating the a„'s and the Sn's are useful: 

Sn = o-o + a\ + a 2 + ... + a N , (2.32) 

Sjv+i = Sn + ajv+i, (2.33) 

and 

M 
Sm — Sk = /_^ a n = a K+l + a K+2 + ■•■ + A M , (2.34) 

n=K+l 

for M > K. 

2.9: 

REMARK Determining whether or not a given infinite series converges is one of the most im- 
portant and subtle parts of analysis. Even the first few elementary theorems depend in deep ways 
on our previous development, particularly the Cauchy criterion. 

Theorem 2.14: 

Let {a n } be a sequence of nonnegative real numbers. Then the infinite series Y a n ls summable if 
and only if the sequence {Sn} of partial sums is bounded. 
Proof: 

If Y a n i s summable, then {Sn} is convergent, whence bounded according to Theorem 2.4, p. 
36. Conversely, we see from the hypothesis that each a n > that {Sn} is nondecreasing (Sn+i = 
Sn + &JV+1 > Sn)- So, if {Sn} is bounded, then it automatically converges by Theorem 2.1, p. 33, 
and hence the infinite series Y a n is summable. 

The next theorem is the first one most calculus students learn about infinite series. Unfortunately, it is 
often misinterpreted, so be careful! Both of the proofs to the next two theorems use Theorem 2.9, Cauchy 
Criterion, p. 43, which again is a serious and fundamental result about the real numbers. Therefore, these 
two theorems must be deep results themselves. 



B This content is available online at <http://cnx.Org/content/m36135/l.2/>. 
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Theorem 2.15: 

Let Y a n be a convergent infinite series. Then the sequence {a n } is convergent, and lima n = 0. 
Proof: 

Because Y a n is summable, the sequence {Sn} is convergent and so is a Cauchy sequence. There- 
fore, given an e > 0, there exists an No so that \S n — S m \ < e whenever both n and m > Nq. If 
n > Nq, let m = n — 1. We have then that \a n \ = \S n — S m \ < e, which completes the proof. 

2.10: 

REMARK Note that this theorem is not an "if and only if" theorem. The harmonic series (part 
(b) of Exercise 2.26 below) is the standard counterexample. The theorem above is mainly used to 
show that an infinite series is not summable. If we can prove that the sequence {a n } does not 
converge to 0, then the infinite series Y a n does not converge. The misinterpretation of this result 
referred to above is exactly in trying to apply the (false) converse of this theorem. 

Theorem 2.16: 

If Y a n is an absolutely convergent infinite series of complex numbers, then it is a convergent 
infinite series. (Absolute convergence implies convergence.) 
Proof: 

If {Sn} denotes the sequence of partial sums for Y a n, and if {Tn} denotes the sequence of partial 
sums for Y \ a n\, then 

M M 

\S M -S N \ = \ Y, a n\< Yl \ a n\ = \T M ~T N \ (2.35) 

n=N+l n=N+l 

for all N and M. We are given that {Tn} is convergent and hence it is a Cauchy sequence. So, by 
the inequality above, {Sn} must also be a Cauchy sequence. (If \T^ — 7m| < £, then |5jv — Sm\ < £ 
as well.) This implies that Y a n is convergent. 

Exercise 2.25: The Infinite Geometric Series 

Let z be a complex number, and define a sequence {a n } by a n = z n . Consider the infinite series 
Y a n- Show that Y^o a « converges to a number S if and only if \z\ < 1. Show in fact that 
S= 1/(1- z), when \z\ < 1. 

HINT: Evaluate explicitly the partial sums Sn, and then take their limit. Show that Sn = 

l-z N + 1 

1-2 ■ 

Exercise 2.26 

a. Show that Y^Li n (n+i) conver g e s to 1, by computing explicit formulas for the partial sums. 
HINT: Use a partial fraction decomposition for the o n 's. 

b. (The Harmonic Series.) Show that Y^Li V n diverges by verifying that S 2 k > k/2. HINT: 
Group the terms in the sum as follows, 

1 (\ 1\ (\ 1 1 1\ (\ 1 1 \ , 

and then estimate the sum of each group. Remember this example as an infinite series that 
diverges, despite the fact that is terms tend to 0. 

The next theorem is the most important one we have concerning infinite series of numbers. 

Theorem 2.17: Comparison Test 

Suppose {a n } and {&„} are two sequences of nonnegative real numbers for which there exists a 
positive integer M and a constant C such that 6„ < Ca n for all n > M. If the infinite series Y a n 
converges, so must the infinite series Y^n- 
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Proof: 

We will show that the sequence {Tjv} of partial sums of the infinite series J] 6„ is a bounded 
sequence. Then, by Theorem 2.14, p. 48, the infinite series Y b n must be summable. 

Write Sn for the ./Vth partial sum of the convergent infinite series Y a n- Because this series 
is summable, its sequence of partial sums is a bounded sequence. Let B be a number such that 
S N < B for all N. We have for all N > M that 

rclT N = J2n=l bn 

^ E„=i b n + Y n =M+i Ca n 



= E„=l b n + C Yn=M+l a n ( 2 - 37 ) 



M 



< E„=i b n + CS 



N 



.A/ 



< E„=i K + CB, 

which completes the proof, since this final quantity is a fixed constant. 

Exercise 2.27 

a. Let {a n } and {&„} be as in the preceding theorem. Show that if Y b n diverges, then J2 a n 
also must diverge. 

b. Show by example that the hypothesis that the a„'s and 6 n 's of the Comparison Test are 
nonnegative can not be dropped. 

Exercise 2.28: The Ratio Test 

Let {a n } be a sequence of positive numbers. 

a. If lim supa n+ i/a n < 1, show that Y a n converges. HINT: If lim supa n +i/a n = a < 1, let 
(3 be a number for which a < /3 < 1. Using part (a) of Exercise 2.17, show that there exists 
an N such that for all n > TV we must have a n+ \/a n < (3, or equivalently a n+ \ < f3a n , and 
therefore a,N+k < /3 k a,N- Now use the comparison test with the geometric series J20 k - 

b. If lim infa n +i/a n > 1, show that Y, a n diverges. 

c. As special cases of parts (a) and (b), show that {a n } converges if lim n a n+ i/a n < 1, and 
diverges if lim n a n+ i/a n > 1. 

d. Find two examples of infinite series' J2 a n of positive numbers, such that lima n +i/a n = 1 for 
both examples, and such that one infinite series converges and the other diverges. 

Exercise 2.29 

a. Derive the Root Test: If {a n } is a sequence of positive numbers for which lim supa n < 1, 
then Y a n converges. And, if lim infa n > 1, then ^ a n diverges. 

b. Let r be a positive integer. Show that Yl ^-/n r converges if and only if r > 2. HINT: Use 
Exercise 2.26 and the Comparison Test for r = 2. 

c. Show that the following infinite series are summable. 

£l/(n 2 + l), 5>/ 2 "' $>"M ( 2 - 38 ) 

for a any complex number. 



51 



Exercise 2.30 

Let {a n } and {&„} be sequences of complex numbers, and let {Sn} denote the sequence of partial 
sums of the infinite series Yl a n- Derive the Abel Summation Formula: 

N N-l 

^2 a n b n = S N b N +^2 S n (b n - b n+1 ) . (2.39) 

n— 1 n—1 

The Comparison Test is the most powerful theorem we have about infinite series of positive terms. Of course, 
most series do not consist entirely of positive terms, so that the Comparison Test is not enough. The next 
theorem is therefore of much importance. 

Theorem 2.18: Alternating Series Test 

Suppose {ai,a2,<i3, ■•■} is an alternating sequence of real numbers; i.e., their signs alternate. As- 
sume further that the sequence {|a n |} is nonincreasing with = lim\a n \. Then the infinite series 
Y a n converges. 
Proof: 

Assume, without loss of generality, that the odd terms <22n+i of the sequence {a n } are positive and 
the even terms ai n are negative. We collect some facts about the partial sums Sn = ai+a,2 + --- + ciN 
of the infinite series Y a n- 

1. Every even partial sum S2N is less than the following odd partial sum S2N+1 = S2N + a2N+i, 
And every odd partial sum S2N+1 is greater than the following even partial sum S2N+2 = 

S2N+I + 0-2N+2- 

2. Every even partial sum S2N is less than or equal to the next even partial sum S2N+2 = 
S2N J ra-2N+i J r&2N+2) implying that the sequence of even partial sums {S*2Ar} is nondecreasing. 

3. Every odd partial sum S2N+1 is greater than or equal to the next odd partial sum S2N+3 = 
S2N+1 + &2N+2 + fl2Af+3, implying that the sequence of odd partial sums {S2N+1} is nonin- 
creasing. 

4. Every odd partial sum S2N+1 is bounded below by S2- For, S2N+1 > -SW > ^2- And, every 
even partial sum S2N is bounded above by Si. For, S2N < S2N+1 5= Si. 

5. Therefore, the sequence {S2N} of even partial sums is nondecreasing and bounded above. That 
sequence must then have a limit, which we denote by S e . Similarly, the sequence {S^iv+i} 
of odd partial sums is nonincreasing and bounded below. This sequence of partial sums also 
must have a limit, which we denote by S . 

Now 

S — S e = UmS2N+i — HmS2N = lifn (SW+i — -SW) = Uma2N+i = 0, (2.40) 

showing that S e = S , and we denote this common limit by S. Finally, given an e > 0, there 
exists an Ni so that \S2N — S\ < e if 2N > Ni, and there exists an N2 so that jSW+i — S\ < e if 
2N + 1 > N2. Therefore, if N > max (Ni,N2) , then \Sn — S\ < e, and this proves that the infinite 
series converges. 

Exercise 2.31: The Alternating Harmonic Series 

a. Show that J^^Li ( — l)"/ n converges, but that it is not absolutely convergent. 

b. Let {a n } be an alternating series, as in the preceding theorem. Show that the sum S = Yl a n 
is trapped between Sn and Sjv+i, an d that \S — Sn\ < \o-n\- 

c. State and prove a theorem about "eventually alternating infinite series." 

d. Show that Y z n /n converges if and only if \z\ < 1, and z ^ 1. HINT: Use the Abel Summation 
Formula to evaluate the partial sums. 
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Exercise 2.32 

Let s = p/q be a positive rational number. 

a. For each x > 0, show that there exists a unique y > such that y s = x; i.e., y p = x q . 

b. Prove that ^l/n s converges if s > 1 and diverges if s < 1. HINT: Group the terms as in 
part (b) of Exercise 2.26. 

Theorem 2.19: Test for Irrationality 

Let a; be a real number, and suppose that {pn/^n} is a sequence of rational numbers for which 
x = UmpN /qN and x ^ Pn/c/n for any N. If limqiq\x — Pn/qn\ = 0, then x is irrational. 
Proof: 

We prove the contrapositive statement; i.e., if x = p/q is a rational number, then UmqN\% — 
Pn/qn\ ¥= 0- We have 

/ / / pqN-qpN /n .-., 

x-pN/qN = P/q-PN/qN = • ( 2 -4i) 

qqN 

Now the numerator pq^ — qpjq is not for any N. For, if it were, then x = p/q = Pn /qN, which 
we have assumed not to be the case. Therefore, since pq^ — qpx is an integer, we have that 

\x-Pn qN\ = \ > ■; 1- (2.42) 

qqN \qqN\ 



So, 



q N \x-p N /qN\ > 7-j, (2-43) 

\q\ 



and this clearly does not converge to 0. 



Exercise 2.33 



a. Let x = J2^Lo (— 1)™/2™. Prove that x is a rational number. 

b. Let y = J]^Lo ( — 1)™/2™ • Prove that y is an irrational number. HINT: The partial sums of 
this series are rational numbers. Now use the preceding theorem and part (b) of Exercise 2.31 
(The Alternating Harmonic Series). 



Chapter 3 

Functions and Continuity 



3.1 Functions and Continuity Definition of the Number it 1 

The concept of a function is perhaps the most basic one in mathematical analysis. The objects of interest 
in our subject can often be represented as functions, and the " unknowns" in our equations are frequently 
functions. Therefore, we will spend some time developing and understanding various kinds of functions, 
including functions defined by polynomials, by power series, and as limits of other functions. In particular, we 
introduce in this chapter the elementary transcendental functions. We begin with the familiar set theoretical 
notion of a function, and then move quickly to their analytical properties, specifically that of continuity. 
The main theorems of this chapter include: 

1. The Intermediate Value Theorem (Theorem 3.6, Intermediate Value Theorem, p. 64), 

2. the theorem that asserts that a continuous real-valued function on a compact set attains a 
maximum and minimum value (Theorem 3.8, p. 65), 

3. A continuous function on a compact set is uniformly continuous (), 

4. The Identity Theorem for Power Series Functions (Theorem 3.14, Identity Theorem, p. 71), 

5. The definition of the real number n, 

6. The theorem that asserts that the uniform limit of a sequence of continuous functions is 
continuous (Theorem 3.18, The uniform limit of continuous functions is continuous., p. 76), and 

7. the Weierstrass M-Test (Theorem 3.19, Weierstrass M-Test, p. 77). 



3.2 Functions 2 

Definition 3.1: 

Let S and T be sets. A function from S into T (notation / : S — » T) is a rule that assigns to each 
element x in S a unique element denoted by / (x) in T. 

It is useful to think of a function as a mechanism or black box. We use the elements of S as 
inputs to the function, and the outputs are elements of the set T. 

If / : S — > T is a function, then S is called the domain of /, and the set T is called the codomain 
of /. The range or image of / is the set of all elements y in the codomain T for which there exists 
an x in the domain S such that y = f (x) . We denote the range by / (S) . The codomain is the set 
of all potential outputs, while the range is the set of actual outputs. 

Suppose / is a function from a set S into a set T. If A C S, we write / (A) for the subset of T 
containing all the elements t € T for which there exists an s € A such that t = f (s) . We call / (A) 



1 This content is available online at <http://cnx.Org/content/m36131/l.2/>. 
2 This content is available online at <http://cnx.Org/content/m36141/l.2/>. 
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the image of A under /. Similarly, if B C T, we write / _1 (B) for the subset of S containing all 
the elements s G S such that / (s) s £>, and we call the set f~ l (B) the inverse image or preimage 
of £>. The symbol / _1 (£>) is a little confusing, since it could be misinterpreted as the image of the 
set B under a function called / _1 . We will discuss inverse functions later on, but this notation is 
not meant to imply that the function / has an inverse. 

If / : S —> T, then the graph of / is the subset G of the Cartesian product S x T consisting of 
all the pairs of the form (x, f (x)) . 

If / : S — > R is a function, then we call / a real-valued function, and if / : S — * C, then 
we call / a complex-valued function. If / : S — > C is a complex-valued function, then for each 
x £ S the complex number / (x) can be written as u (x) + iv (x) , where u (x) and v (x) are the real 
and imaginary parts of the complex number / (x) . The two real- valued functions u : S — » R and 
v : S — > R are called respectively the real and imaginary parts of the complex- valued function /. 

If / : S — » T and SCR, then / is called a function of a real variable, and if S C C, then / is 
called a function of a complex variable. 

If the range of / equals the codomain, then / is called onto. 

The function / : S — > T is called one-to-one if / (2:1) = / (#2) implies that x\ = X2- 

The domain of / is the set of x's for which / (a;) is defined. If we are given a function / : S — > T, we 
are free to regard / as having a smaller domain, i.e., a subset S' of S. Although this restricted function is 
in reality a different function, we usually continue to call it by the same name /. Enlarging the domain of a 
function, in some consistent manner, is often impossible, but is nevertheless frequently of great importance. 
The codomain of / is distinguished from the range of f, which is frequently a proper subset of the codomain. 
For example, since every real number is a complex number, any real-valued function / : S — > R is also a 
(special kind of) complex-valued function. 

We consider in this book functions either of a real variable or of complex variable, that is, the domains 
of functions here will be subsets either of R or of C. Frequently, we will indicate what kind of variable we are 
thinking of by denoting real variables with the letter x and complex variables with the letter z. Be careful 
about this, for this distinction is not always made. 

Many functions, though not all by any means, are defined by a single equation: 

y = 3x - 7, (3.1) 

y=(x 2 + x+l) 2/ \ (3.2) 

x 2 + y 2 = 4, (3.3) 

(How does this last equation define a function?) 

(l-sV^/^/Cl-y)) 8 / 17 . (3.4) 

(How does this equation determine a function?) 

There are various types of functions, and they can be combined in a variety of ways to produce other 
functions. It is necessary therefore to spend a fair amount of time at the beginning of this chapter to present 
these definitions. 

Definition 3.2: 

If / and g are two complex- valued functions with the same domain S, i.e., / : S — > C and g : S — » C, 
and if c is a complex number, we define f + g, fg, f/g (if g (x) is never 0), and cf by the familiar 
formulas: 

(f + g)(x) = f(x)+g(x), (3.5) 

(fg)(x) = f(x)g(x), (3.6) 
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(f/g)(x) = f(x)/g(x), (3.7) 

and 

(cf) (x) = cf (x) . (3.8) 

If / and g are real-valued functions, we define functions max (/, g) and min (/, g) by 

[max (/, g)] (x) = max (/ (x) , 5 (x)) (3.9) 

(the maximum of the numbers / (x) and g (x)), and 

[min (/, g)] (x) = min (/ (x) , 3 (x)) , (3.10) 

(the minimum of the two numbers / (x) and g (x)). 

If / is either a real- valued or a complex- valued function on a domain S, then we say that / is 
bounded if there exists a positive number M such that \f (x) | < M for all x e S. 

There are two special types of functions of a real or complex variable, the even functions and the odd 
functions. In fact, every function that is defined on all of R or C (or, more generally, any function whose 
domain S equals — S) can be written uniquely as a sum of an even part and an odd part. This decomposition 
of a general function into simpler parts is frequently helpful. 

Definition 3.3: 

A function / whose domain S equals —S, is called an even function if f(—z) = f (z) for all z in 
its domain. It is called an odd function if / (—z) = — f (z) for all z in its domain. 

We next give the definition for perhaps the most familiar kinds of functions. 

Definition 3.4: 

A nonzero polynomial or polynomial function is a complex- valued function of a complex variable, 
p : C — > C, that is defined by a formula of the form 

n 

p (z) = 2^ a k z = a + a l z + a 2 z2 + ••• + 0"nZ n , (3-11) 

k=0 

where the a^'s are complex numbers and a n ^ 0. The integer n is called the degree of the polynomial 
p and is denoted by deg (p) . The numbers ao,a\, ..., a n are called the coefficients of the polynomial. 
The domain of a polynomial function is all of C; i.e., p(z) is defined for every complex number z. 

For technical reasons of consistency, the identically function is called the zero polynomial. All 
of its coefficients are and its degree is defined to be — oo. 

A rational function is a function r that is given by an equation of the form r (z) = p (z) /q (z) , 
where q is a nonzero polynomial and p is a (possibly zero) polynomial. The domain of a rational 
function is the set S of all z G C for which q (z) / 0, i.e., for which r (z) is defined. 

Two other kinds of functions that are simple and important are step functions and polygonal functions. 

Definition 3.5: 

Let [a, b] be a closed bounded interval of real numbers. By a partition of [a, b] we mean a finite set 
P = {xq < xi < ... < x„} of n + 1 points, where xq = a and x n = b. 

The n intervals {[x,_i,Xj]}, for 1 < i < n, are called the closed subintervals of the partition P, 
and the n intervals {(xj_i, xi)} are called the open subintervals of P. 

We write || P || for the maximum of the numbers (lengths of the subintervals) {xi — Xj_i}, and 
call the number || P || the mesh size of the partition P. 

A function h : [a, b] — * C is called a step function if there exists a partition P = {xo < x\ < 
... < x n } of [a, b] and n numbers {ai,a2, ...,a„} such that h(x) = ai if Xj_i < x < Xj. That is, h is 
a step function if it is a constant function on each of the (open) subintervals (xi-i,Xi) determined 
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by a partition P. Note that the values of a step function at the points {xi} of the partition are not 
restricted in any way. 

A function I : [a, b] — * R is called a polygonal function, or a piecewise linear function, if there 
exists a partition P = {xq < x\ < ... < x n } of [a, b] and n+ 1 numbers {yo,yi, ...,y„} such that for 
each x € [:Ej_i,:rj] ,1 (x) is given by the linear equation 

l(x) = Vi-i + rrn (x - Xi-i) , (3-12) 

where m, = (y, — J/»i) / (%i — Xi-i) ■ That is, I is a polygonal function if it is a linear function on 
each of the closed subintervals [xi_i,Xi\ determined by a partition P. Note that the values of a 
piecewise linear function at the points {x^ of the partition P are the same, whether we think of 
Xi in the interval [:Ej_i, Xj\ or [xi, Xi+{\ . (Check the two formulas for I [xi) .) 

The graph of a piecewise linear function is the polygonal line joining the n + 1 points {(a;,, yi)}- 
There is a natural generalization of the notion of a step function that works for any domain S, 
e.g., a rectangle in the plane C. Thus, if S is a set, we define a partition of S to be a finite collection 
{.Ei, E2, ..., E n } of subsets of S for which 

1. U™ =1 £ 4 = S, and 

2. Ei n Ej = if % =£ j. 

Then, a step function on S would be a function h that is constant on each subset Ei. We will 
encounter an even more elaborate generalized notion of a step function in Chapter V, but for now 
we will restrict our attention to step functions defined on intervals [a, b] . 

The set of polynomials and the set of step functions are both closed under addition and multi- 
plication, and the set of rational functions is closed under addition, multiplication, and division. 

Exercise 3.1 

a. Prove that the sum and product of two polynomials is again a polynomial. Show that 
deg (p + q) < max (deg (p) , deg (q)) and deg (jpq) = deg (p) + deg (q) . Show that a constant 
function is a polynomial, and that the degree of a nonzero constant function is 0. 

b. Show that the set of step functions is closed under addition and multiplication. Show also 
that the maximum and minimum of two step functions is again a step function. (Be careful 
to note that different step functions may be determined by different partitions. For instance, 
a partition determining the sum of two step functions may be different from the partitions 
determining the two individual step functions.) Note, in fact, that a step function can be 
determined by infinitely many different partitions. Prove that the sum, the maximum, and 
the minimum of two piecewise linear functions is again a piecewise linear function. Show by 
example that the product of two piecewise linear functions need not be piecewise linear. 

c. Prove that the sum, product, and quotient of two rational functions is again a rational func- 
tion. 

d. Prove the Root Theorem: If p (z) = J]fc=o a k zk ls a nonzero polynomial of degree n, 
and if c is a complex number for which p (c) = 0, then there exists a nonzero polynomial 
Q ( z ) = S?=o ^i z ^ °f degree n — 1 such that p (z) = (z — c) q (z) for all z. That is, if c is a 
"root" of p, then z — c is a factor of p. Show also that the leading coefficient 6 n _i of q equals 
the leading coefficient a n of p. HINT: Write 

n 

p(z)=p (z) -p{c) = Y J *k (z k - c k ) = .... (3.13) 

fe=0 

e. Let / be a function whose domain S equals — S. Define functions f e and f by the formulas 

f(z) + f(-z) /( z )-/(- Z ) 
Je \Z) = 7, and Jo (Zj = " • (3.14) 
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Show that f e is an even function, that f is an odd function, and that / = f e + f . Show 
also that, if / = g + h, where g is an even function and h is an odd function, then g = f e and 
h = f . That is, there is only one way to write / as the sum of an even function and an odd 
function. 

f. Use part (e) to show that a polynomial p is an even function if and only if its only nonzero 
coefficients are even ones, i.e., the o^fc's. Show also that a polynomial is an odd function if 
and only if its only nonzero coefficients are odd ones, i.e., the a,2k+i' s - 

g. Suppose p{z) = J]fc = o a 2fc 2;2fc ls a polynomial that is an even function. Show that 

n 

p {iz) = ]T {-l) k a 2k z 2k = p a (z) , (3.15) 

fe=0 

where p a is the polynomial obtained from p by alternating the signs of its nonzero coefficients, 
h. If q (z) = X^fc=o a 2k+\z 2k+l is a polynomial that is an odd function, show that 

n 
q {iz) = i J2 (-l) k d2k + iz 2k+1 = iq a (z) , (3.16) 

k=a 

where again q a is the polynomial obtained from q by alternating the signs of its nonzero 
coefficients, 
i. If p is any polynomial, show that 

p {iz) = p e {iz) + Po {iz) = p a e (z) + iPo (z) , (3.17) 

and hence that p e {iz) = p° (z) and p {iz) = ip a Q {z) . 



3.3 Polynomial Functions 3 

Ifp(z) = J]fc = o a kZ k and q {z) = J2T= bjzi are two polynomials, it certainly seems clear that they determine 
the same function only if they have identical coefficients. This is true, but by no means an obvious fact. 
Also, it seems clear that, as \z\ gets larger and larger, a polynomial function is more and more comparable to 
its leading term a n z n . We collect in the next theorem some elementary properties of polynomial functions, 
and in particular we verify the above "uniqueness of coefficients" result and the "behavior at infinity" result. 

Theorem 3.1: 

1. Suppose p (z) = J]fe=o a kZ k is a nonconstant polynomial of degree n > 0. Then p{z) = for 
at most n distinct complex numbers. 

2. If r is a polynomial for which r {z) = for an infinite number of distinct points, then r is the 
zero polynomial. That is, all of its coefficients are 0. 

3. Suppose p and q are nonzero polynomials, and assume that p{z) = q {z) for an infinite number 
of distinct points. Then p{z) = q {z) for all z, and p and q have the same coefficients. That 
is, they are the same polynomial. 

4. Let p{z) = Yll^o ] 2 '' be a polynomial of degree n > 0. Then there exist positive constants 
m and B such that 

l -^\z\ n <\ P {z)\<M\z\ n (3.18) 

for all complex numbers z for which \z\ > B. That is, For all complex numbers z with \z\ > B, 
the numbers \p{z) | and \z\ n are "comparable." 



3 This content is available online at <http://cnx.Org/content/m36147/l.2/>. 
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5. If / : [0, oo ) — > C is defined by / (ar) = \fx, then there is no polynomial p for which / (aj) = 
p(x) for all x > 0. That is, the square root function does not agree with any polynomial 
function. 

Proof: 

We prove part (1) using an argument by contradiction. Thus, suppose there does exist a counterex- 
ample to the claim, i.e., a nonzero polynomial p of degree n and n+l distinct points {ci, C2, •••, c n+ \} 
for which p (cj) = for all 1 < j < n + 1. From the set of all such counterexamples, let po be one 
with minimum degree no- That is, the claim in part (1) is true for any polynomial whose degree is 
smaller than uq. We write 

p (z)=J^a k z k , (3.19) 

k=0 

and we suppose that po (cj) = for j = 1 to no + 1, where these Cfc's are distinct complex numbers. 
We use next the Root Theorem (part (d) of Exercise 3.1) to write po (z) = (z — c no+ i) q (z) , where 
9 i z ) = X^felo bkz k . We have that q is a polynomial of degree no — 1 and the leading coefficient a no 
of po equals the leading coefficient 6 no _i of q. Note that for 1 < j < no we have 

= Po (cj) = {cj - c„ 0+ i) q (cj) , (3.20) 

which implies that q(cj) = for 1 < j < no, since Cj — c no +i ^ 0. But, since deg (q) < no, the 
nonzero polynomial q can not be a counterexample to part (1), implying that q (z) = for at most 
no — 1 distinct points. We have arrived at a contradiction, and part (1) is proved. 

Next, let r be a polynomial for which r (z) = for an infinite number of distinct points. It 
follows from part (1) that r cannot be a nonzero polynomial, for in that case it would have a degree 
n > and could be for at most n distinct points. Hence, r is the zero polynomial, and part (2) 
is proved. 

Now, to see part (3), set r = p— q. Then r is a polynomial for which r (z) = for infinitely many 
z's. By part (2), it follows then that r (z) =0 for all z, whence p(z) = q(z) for all z. Moreover, 
p — q is the zero polynomial, all of whose coefficients are 0, and this implies that the coefficients for 
p and q are identical. 

To prove the first inequality in part (4), suppose that \z\ > 1, and from the backwards triangle 
inequality, note that 

|p(*)l = l£Lo<** fc l 

M"IELo^l 



= \z\ n \(Z n k Zo^)+c n 

> M"(M-I£rd^l) (3.2i) 



> 
> 



M n {\Cn\-EV J^ 

r(M- jr\T,tZo\ck\ 



> \z 

Set B equal to the constant (2/|c„|) Yl^Zo \ c j\- Then, replacing the \/\z\ in the preceding calculation 
by 1/B, we obtain 

\p (z) \ > m\z\ n (3.22) 

for every z for which \z\ > B. This proves the first half of part (4). 
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To get the other half of part (4), suppose again that \z\ > 1. We have 

n n 

M^I<£k|M fc < ^klM", (3.23) 

fe=0 k=o 

so that we get the other half of part (4) by setting M = J]fe=o l c *l- 

Finally, to see part (5), suppose that there does exist a polynomial p of degree n such that 
y/x = p (x) for all x > 0. Then x = (p (x)) for all x > 0. Now p 2 is a polynomial of degree In. By 
part (2), the two polynomials q (x) = x and (p (xj) must be the same, implying that they have the 
same degree. However, the degree of q is 1, which is odd, and the degree of p 2 is 2n, which is even. 
Hence, we have arrived at a contradiction. 

Exercise 3.2 

a. Let r (z) = p(z) /q(z) and r (z) = p (z) /q (z) be two rational functions. Suppose r(z) = 
r (z) for infinitely many z'a. Prove that r (z) = r (z) for all z in the intersection of their 
domains. Is it true that p = p and q = q'l 

b. Let p and q be polynomials of degree n and m respectively, and define a rational function r 
by r = p/q. Prove that there exist positive constants C and B such that \r (z) | < C\z\ n ~ m 
for all complex numbers z for which \z\ > B. 

c. Define / : [0, oo) — » R by / (x) = y/x. Show that there is no rational function r such that 
/ (x) = r (x) for all x > 0. That is, the square root function does not agree with a rational 
function. 

d. Define the real- valued function r on R by r (x) = 1/ (l + x 2 ) . Prove that there is no polyno- 
mial p such that p (x) = r (x) for infinitely many real numbers x. 

e. If / is the real-valued function of a real variable given by / (x) = \x\, show that / is not 
a rational function. HINT: Suppose \x\ = p(x) /q(x). Then |a;|g(x) = p(x) implying that 
\x\q (x) is a polynomial s (x) . Now use Theorem 3.1 to conclude that p (x) = xq (x) for all x 
and that p (x) = —xq (x) for all x. 

f. Let / be any complex-valued function of a complex variable, and let c\, ...,c n be n distinct 
complex numbers that belong to the domain of /. Show that there does exist a polynomial p 
of degree n such that p (cj) = f (cj) for all 1 < j < n. HINT: Describe p in factored form. 

g. Give examples to show that the maximum and minimum of two polynomials need not be a 
polynomial or even a rational function. 

Very important is the definition of the compositiong o / of two functions / and g. 

Definition 3.6: 

Let / : S — > T and g : T — > U be functions. We define a function g o /, with domain S and 
codomain U, by (g o /) (x) = g (f (x)) . 

If / : S — > T,g : T — » S, and g o / (x) = x for all x € 5, then g is called a left inverse of /. If 
f °9 (y) = V f° r a ll y € T, then g is called a right inverse for /. If g is both a left inverse and a right 
inverse, then g is called an inverse for /,/ is called invertible, and we denote g by / _1 . 

Exercise 3.3 

a. Suppose / : S — > T has a left inverse. Prove that / is 1-1. 

b. Suppose / : S — > T has a right inverse. Prove that / is onto. 

c. Show that the composition of two polynomials is a polynomial and that the composition of 
two rational functions is a rational function. HINT: If p is a polynomial, show by induction 
that p n is a polynomial. Now use Exercise 3.1. 

d. Find formulas for gof and fog for the following. What are the domains of these compositions? 
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f{x) = l + x 2 and g (x) = 1/(1 + x) 1/2 . 
f (x) = xj (x + 1) and g (x) = x/ (1 — x) 
f (x) = ax + b and g (x) = ex + d. 



3.4 Continuity 4 

Next, we come to the definition of continuity. Unlike the preceding discussion, which can be viewed as being 
related primarily to the algebraic properties of functions, this one is an analytic notion. 

Definition 3.7: 

Let S and T be sets of complex numbers, and let / : S — > T. Then / is said to be continuous at a 
pointc of S if for every positive e, there exists a positive S such that if x € S satisfies \x — c\ < S, 
then |/ (x) — / (c) | < e. The function / is called continuous on S if it is continuous at every point 
coiS. 

If the domain S of / consists of real numbers, then the function / is called right continuous at c 
if for every e > there exists a 5 > such that \f (x) — / (c) | < e whenever x G S and < x — c < S, 
and is called left continuous at c if for every e > there exists a 5 > such that \f (x) — f (c) | < e 
whenever x G S and > x — c > — <5. 

3.1: 

REMARK If / is continuous at a point c, then the positive number S of the preceding definition 
is not unique (any smaller number would work as well), but it does depend both on the number e 
and on the point c. Sometimes we will write 5 (e, c) to make this dependence explicit. Later, we 
will introduce a notion of uniform continuity in which S only depends on the number e and not on 
the particular point c. 

The next theorem indicates the interaction between the algebraic properties of functions and continuity. 

Theorem 3.2: 

Let S and T be subsets of C, let / and g be functions from S into T, and suppose that / and g 
are both continuous at a point c of S. Then 

1. There exists a 5 > and a positive number M such that if \y — c\ < 5 and y e S then 
1/ (y) | < M. That is, if / is continuous at c, then it is bounded near c. 

2. / + g is continuous at c. 

3. /(/ is continuous at c. 

4. |/| is continuous at c. 

5. If g (c) / 0, then f/g is continuous at c. 

6. If / is a complex-valued function, and u and v are the real and imaginary parts of /, then / 
is continuous at c if and only if u and v are continuous at c. 

Proof: 

We prove parts (1) and (5), and leave the remaining parts to the exercise that follows. 

To see part (1), let e = 1. Then, since / is continuous at c, there exists a 6 > such that if 
\y — c\ < 5 and y s S then \f (y) — / (c) | < 1. Since |z — w\ > \\z\ — \w\\ for any two complex 
numbers z and w (backwards Triangle Inequality), it then follows that ||/ (y) \ — \f (c) || < 1, from 
which it follows that if \y - c\ < S then \f (y) | < |/ (c) | + 1. Hence, setting M = |/ (c) | + 1, we 
have that if \y — c\ < 6 and y e S, then \f (y) \ < M as desired. 

To prove part (5), we first make use of part 1. Let S\,M\ and 62, Mi be chosen so that if 
\y — c\ < Si and y £ S then 

|/(y)|<Mi (3.24) 



4 This content is available online at <http://cnx.Org/content/m36150/l.2/>. 
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and if \y — c\ < <5 2 and y s S then 

\g (y) I < M 2 (3.25) 

Next, let e be the positive number \g (c) |/2. Then, there exists a S' > such that if |y — c\ < 6' and 
y & S then |g (y) — g (c) | < e' = \g (c) |/2. It then follows from the backwards triangle inequality 
that 

\g (y) I > e = \g (c) |/2 so that |l/<? (y) | < 2/\g (c) | (3.26) 

Now, to finish the proof of part (5), let £ > be given. If \y — c\ < min (Si, <5 2 , S) and y g 5, then 
from Inequalities (3.1), (3.2), and (3.3) we obtain 

I f{y) /(c) I = l/fa)g(c)-/(c)g(y)l 

l/fa)g(c)-/(c)g(c)+/(c) 3 (c)-/(c)g( l ;)| 
l9(y)llff(c)l 

<r l/fa)~/(c)l|g(c)|+|/(c)||g(c)- g fa)| 

- \g{v)\\g{c)\ 

< (|/ (y) - / (c) |M 2 + Milg (c) - S (y) |) x ^. 

Finally, using the continuity of both / and g applied to the positive numbers ei = s/ ( 4M 2 \g (c) | ) 

and e 2 = e/ (4Mi|y (c) | ) , choose 5 > 0, with <5 < rain (5\, 82, S) , and such that if \y — c\ < 5 and 
y e S then \f (y) - / (c) | < 4M2/ , g(c)| 2 and | 5 (c) - y (y) | < 4Mi/ [ g(c) |2 - Then, if \y - c| < <5 and 



(3.27) 



y s 5 we have that 



1^1 -^|l< £ (3-28) 

9(2/) 5(c) 



as desired. 

Exercise 3.4 

a. Prove part (2) of the preceding theorem. (It's an e/2 argument.) 

b. Prove part (3) of the preceding theorem. (It's similar to the proof of part (5) only easier.) 

c. Prove part (4) of the preceding theorem. 

d. Prove part (6) of the preceding theorem. 

e. Suppose S is a subset of R. Verify the above theorem replacing " continuity" with left conti- 
nuity and right continuity. 

f. If S is a subset of R, show that / is continuous at a point c s S if and only if it is both right 
continuous and left continuous at c. 

Theorem 3.3: The composition of continuous functions is continuous. 

Let S, T, and U be subsets of C, and let / : S — > T and g : T — > U be functions. Suppose / is 
continuous at a point c G S and that g is continuous at the point / (c) € T. Then the composition 
g o / is continuous at c. 
Proof: 

Let e > be given. Because g is continuous at the point / (c) , there exists an a > such that 
\g (t) — g (/ (c)) I < £ if |i — / (c) I < a. Now, using this positive number a, and using the fact that / 
is continuous at the point c, there exists a 6 > so that \f (s) — f (c) | < a if \s — c\ < S. Therefore, 
if \s-c\< 5, then \f (s) - f (c) | < a, and hence \g (f (s)) - g (/ (c)) | = \g o / ( s ) - g o f (c) | < e, 
which completes the proof. 

Exercise 3.5 
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a. If / : C — > C is the function defined by / (z) = z, prove that / is continuous at each point of 
C. 

b. Use part (a) and Theorem 3.2 to conclude that every rational function is continuous on its 
domain. 

c. Prove that a step function h : [a, 6] — > C is continuous everywhere on [a, b] except possibly at 
the points of the partition P that determines h. 

Exercise 3.6 

a. Let S be the set of nonnegative real numbers, and define / : S — > S by / (x) = ^fx. Prove 
that / is continuous at each point of S. HINT: For c = 0, use 5 = e 2 . For c / 0, use the 
identity 

/- r I r- r \^y + ^ y-c y-c 

^-^ =( ^-^V^^ = V^7^-^- (3 - 29) 

b. If / : C — » R is the function defined by / (z) = \z\, show that / is continuous at every point 
of its domain. 

Exercise 3.7 

Using the previous theorems and exercises, explain why the following functions / are continuous 
on their domains. Describe the domains as well. 

a. f(z) = {l-z 2 )/(l + z 2 ). 

b. f( z ) = \l + z + z 2 + z 3 -{l/z)\. 



c f(z) 



Jl + yJl-W* 



Exercise 3.8 

a. If c and d are real numbers, show that max (c, d) = (c + d) Jl + \c — d\/2. 

b. If / and g are functions from S into R, show that max (f,g) = (f + g) /2 + \f — g\/2. 

c. If / and g are real- valued functions that are both continuous at a point c, show that max (/, g) 
and min (/, g) are both continuous at c. 

Exercise 3.9 

Let TV be the set of natural numbers, let P be the set of positive real numbers, and define / : N — > P 
by / (ri) = \/l + n. Prove that / is continuous at each point of N. Show in fact that every function 
/ : N — » C is continuous on this domain TV. 

HINT: Show that for any e > 0, the choice of 5 = 1 will work. 

Exercise 3.10: Negations 

a. Negate the statement: "For every e > 0,\x\ < e." 

b. Negate the statement: "For every e > 0, there exists an x for which \x\ < e." 

c. Negate the statement that " / is continuous at c." 

The next result establishes an equivalence between the basic e, 5 definition of continuity and a sequential 
formulation. In many cases, maybe most, this sequential version of continuity is easier to work with than 
the £, d version. 

Theorem 3.4: 

Let / : S — » C be a complex-valued function on S, and let c be a point in S. Then / is continuous 
at c if and only if the following condition holds: For every sequence {x n } of elements of S that 
converges to c, the sequence {/ (x n )} converges to / (c) . Or, said a different way, if {x n } converges 
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to c, then {/ (x n )} converges to /(c). And, said yet a third (somewhat less precise) way, the 
function / converts convergent sequences to convergent sequences. 
Proof: 

Suppose first that / is continuous at c, and let {x n } be a sequence of elements of S that converges to 
c. Let e > be given. We must find a natural number N such that if n > TV then \f (x n ) — / (c) | < e. 
First, choose 5 > so that \f (y) — / (c)\ < e whenever y e S and \y — c\ < S. Now, choose N so that 
\x n — c\ < 5 whenever n > N. Then if n > N, we have that \x n — c\ < S, whence \f (x n ) — f (c) | < e. 
This shows that the sequence {/ (x n )} converges to / (c) , as desired. 

We prove the converse by proving the contrapositive statement; i.e., we will show that if / is not 
continuous at c, then there does exist a sequence {x n } that converges to c but for which the sequence 
{/ i x n)} does not converge to / (c) . Thus, suppose / is not continuous at c. Then there exists an 
£o > such that for every 5 > there is a y e S such that \y — c\ < S but \f (y) — / (c) | > £o- 
To obtain a sequence, we apply this statement to <5's of the form 5 = 1/n. Hence, for every natural 
number n there exists a point x n € S such that \x n — c\ < 1/n but \f (x n ) — / (c) | > £q. Clearly, 
the sequence {x n } converges to c since \x n — c\ < 1/n. On the other hand, the sequence {/ (x n )} 
cannot be converging to / (c) , because |/ (x n ) — / (c) | is always > e$. 

This completes the proof of the theorem. 



3.5 Continuity and Topology 

Let / : S — > T be a function, and let A be a subset of the codomain T. Recall that / _1 (A) denotes the 
subset of the domain S consisting of all those x € S for which / (x) € A. 

Our original definition of continuity was in terms of e's and S's. Theorem 3.4, p. 62 established an 
equivalent form of continuity, often called "sequential continuity," that involves convergence of sequences. 
The next result shows a connection between continuity and topology, i.e., open and closed sets. 

Theorem 3.5: 

1. Suppose S is a closed subset of C and that / : S — > C is a complex- valued function on S. 
Then / is continuous on S if and only if / _1 (A) is a closed set whenever A is a closed subset 
of C. That is, / is continuous on a closed set S if and only if the inverse image of every closed 
set is closed. 

2. Suppose U is an open subset of C and that / : U — > C is a complex-valued function on U. 
Then / is continuous on U if and only if / _1 (A) is an open set whenever A is an open subset 
of C. That is, /is continuous on an open set U if and only if the inverse image of every open 
set is open. 

Proof: 

Suppose first that / is continuous on a closed set S and that A is a closed subset of C. We wish to 
show that / _1 (A) is closed. Thus, let {x n } be a sequence of points in / _1 (A) that converges to a 
point c. Because S is a closed set, we know that c € S, but in order to see that / _1 (A) is closed, we 
need to show that c e / _1 (A) . That is, we need to show that / (c) e A. Now, / (x n ) € A for every 
n, and, because / is continuous at c, we have by Theorem 3.4, p. 62 that / (c) = limf (x n ) . Hence, 
/ (c) is a limit point of A, and so / (c) G A because A is a closed set. Therefore, c s f^ 1 (A) , and 
f^ 1 (A) is closed. 

Conversely, still supposing that S is a closed set, suppose / is not continuous on S, and let c be 
a point of S at which / fails to be continuous. Then, there exists an e > and a sequence {x n } 
of elements of S such that c = limx n but such that \f (c) — / (x n ) \ > e for all n. (Why? See the 
proof of Theorem 3.4.) Let A be the complement of the open disk B £ (/ (c)) . Then A is a closed 
subset of C. We have that / (x n ) € A for all n, but / (c) is not in A. So, x n € / _1 (A) for all n, but 



5 This content is available online at <http://cnx.Org/content/m36152/l.2/>. 
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c = limx n is not in f~ x (A) . Hence, / _1 (A) does not contain all of its limit points, and so f^ 1 (A) 
is not closed. Hence, if / is not continuous on S, then there exists a closed set A such that f~ x (A) 
is not closed. This completes the proof of the second half of part (1). 

Next, suppose U is an open set, and assume that / is continuous on U. Let A be an open set 
in C, and let c be an element of f~ l (A) . In order to prove that/ -1 (A) is open, we need to show 
that c belongs to the interior of / _1 (A) . Now, / (c) s A, A is open, and so there exists an e > 
such that the entire disk B e (/ (c)) C A. Then, because / is continuous at the point c, there exists 
a 5 > such that if |ar — c| < 6 then \f (x) — /(c) | < e. In other words, if x € B$ (c) , then 
/ (x) g B e (/ (c)) C A. This means that B$ (c) is contained in / _1 (A) , and hence c belongs to the 
interior of / _1 (A) . Hence, if / is continuous on an open set U, then f~ l (A) is open whenever A 
is open. This proves half of part (2). 

Finally, still assuming that U is open, suppose f~ l (A) is open whenever A is open, let c be a 
point of S, and let us prove that / is continuous at c. Thus, let e > be given, and let A be the 
open set A = B e (/ (c)) . Then, by our assumption, / _1 (A) is an open set. Also, c belongs to this 
open set / _1 (A) , and hence c belongs to the interior of / _1 (A) . Therefore, there exists a 6 > 
such that the entire disk b$ (c) C / _1 (A) . But this means that if e S satisfies \x — c\ < 6, then 
xeB s (c) C /- 1 (A) , and so / (x) € A = B e (/ (c)) . Therefore, if \x-c\ < 6, then \f (x)-f (c) | < s, 
which proves that / is continuous at c, and the theorem is completely proved. 



3.6 Deeper Analytic Properties of Continuous Functions 6 

We collect here some theorems that show some of the consequences of continuity. Some of the theorems 
apply to functions either of a real variable or of a complex variable, while others apply only to functions of 
a real variable. We begin with what may be the most famous such result, and this one is about functions of 
a real variable. 

Theorem 3.6: Intermediate Value Theorem 

If / : [a, b] — » R is a real-valued function that is continuous at each point of the closed interval 
[a, b] , and if v is a number (value) between the numbers / (a) and / (b) , then there exists a point 
c between a and b such that / (c) = v. 
Proof: 

If v = f (a) or / (b) , we are done. Suppose then, without loss of generality, that f (a) < v < f (b) . 
Let S be the set of all x € [a, b] such that / (x) < v, and note that S is nonempty and bounded 
above, (a € S, and b is an upper bound for S.) Let c = supS. Then there exists a sequence {x n } 
of elements of S that converges to c. (See Exercise 2.20.) So, / (c) = limf (x n ) by Theorem 3.4, p. 
62. Hence, / (c) < v. (Why?) 

Now, arguing by contradiction, if / (c) < v, let e be the positive number v — f (c) . Because / is 
continuous at c, there must exist a 5 > such that \f (y) — / (c) | < e whenever \y — c\ < 5 and y g 
[a, b] . Since any smaller 5 satisfies the same condition, we may also assume that 5 < b— c. Consider 
y = c+S/2. Then y e [a, b] , \y — c\ < 5, and so \f (y) — / (c) | < e. Hence / (y) < f (c)+e = v, which 
implies that y s S. But, since c = supS,c must satisfy c > y = c+ 6/2. This is a contradiction, so 
/ (c) = v, and the theorem is proved. 

The Intermediate Value Theorem tells us something qualitative about the range of a continuous function 
on an interval [a, b] . It tells us that the range is "connected;" i.e., if the range contains two points c and d, 
then the range contains all the points between c and d. It is difficult to think what the analogous assertion 
would be for functions of a complex variable, since "between" doesn't mean anything for complex numbers. 
We will eventually prove something called the Open Mapping Theorem in Section 7.6 that could be regarded 
as the complex analog of the Intermediate Value Theorem. 



6 This content is available online at <http://cnx.Org/content/m36167/l.2/>. 
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The next theorem is about functions of either a real or a complex variable. 

Theorem 3.7: 

Let / : S — > C be a continuous function, and let C be a compact (closed and bounded) subset of 
S. Then the image / (C) of C is also compact. That is, the continuous image of a compact set is 
compact. 
Proof: 

First, suppose / (C) is not bounded. Thus, let {x n } be a sequence of elements of C such that, 
for each n,\f (x„) | > n. By the Bolzano- Weierstrass Theorem, the sequence {x„} has a convergent 
subsequence {x nk }. Let x = limx nk . Then x e C because Cis a closed subset of C. Co, / (x) = 
limf (x nk ) by Exercise 2.20. But since |/(x nfc )| > rik, the sequence {/ (x nk )} is not bounded, 
so cannot be convergent. Hence, we have arrived at a contradiction, and the set / (C) must be 
bounded. 

Now, we must show that the image / (C) is closed. Thus, let y be a limit point of the image / (C) 
of C, and let y = limy n where each y n e / (C) . For each n, let i„gC satisfy / (x n ) = y n . Again, 
using the Bolzano- Weierstrass Theorem, let {i„J be a convergent subsequence of the bounded 
sequence {x n }, and write x = limx nk . Then x € C, since C is closed, and from Exercise 2.20 

y = Umf (x n ) = limf (x nk ) = f (x) , (3.30) 

showing that y e / (C) , implying that / (C) is closed. 

This theorem tells us something about the range of a continuous function of a real or complex variable. 
It says that if a subset of the domain is closed and bounded, so is the image of that subset. 

The next theorem is about continuous real-valued functions of a complex variable, and it is one of the 
theorems to remember. 

Theorem 3.8: 

Let / be a continuous real-valued function on a compact subset S of C. Then / attains both 
a maximum and a minimum value on S. That is, there exist points z\ and z 2 in S such that 
f (zi) < f (z) < f (to) far all z € S. 
Proof: 

We prove that / attains a maximum value, leaving the fact that / attains a minimum value to the 
exercise that follows. Let M be the supremum of the set of all numbers / (x) for x € S. (How do we 
know that this supremum exists?) We will show that there exists an z 2 e S such that / (z 2 ) = M . 
This will finish the proof, since we would then have / (z 2 ) = Mq > f (z) for all z € S. Thus, let {y n } 
be a sequence of elements in the range of / for which the sequence {y n } converges to Mo- (This 
is Exercise 2.20 again.) For each n, let x n be an element of S such that y n = f (x n ) . Then the 
sequence {/(x n )} converges to Mo. Let {i n J be a convergent subsequence of {x n }. (How?) Let 
z 2 = limx nk . Then z 2 € S, because S is closed, and f (z 2 ) = limf (x nk ) , because / is continuous. 
Hence, / (z 2 ) = Mo, as desired. 

Exercise 3.11 

a. Prove that the / of the preceding theorem attains a minimum value on S. 

b. Give an alternate proof of Theorem 3.8, p. 65 by using Theorem 3.7, p. 65, and then proving 
that a closed and bounded subset of R contains both its supremum and its infimum. 

c. Let S be a compact subset of C, and let c be a point of C that is not in S. Prove that there is a 
closest point to c in S. That is, show that there exists a point w € S such that \w — c\ < \z — c\ 
for all points z € S. HINT: The function z — » \z — c\ is continuous on the set S. 

Exercise 3.12 

Let / : [a, b] — > R be a real-valued function that is continuous at each point of [a, b] . 
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a. Prove that the range of / is a closed interval [a , b'~\ . Show by example that the four numbers 
f (a) , f (b) ,o' and b' can be distinct. 

b. Suppose / is 1-1. Show that, if c is in the open interval (o, 6) , then / (c) is in the open interval 

We introduce next a different kind of continuity called uniform continuity. The difference between regular 
continuity and uniform continuity is a bit subtle, and well worth some thought. 

Definition 3.8: 

A function / : S — > C is called uniformly continuous on S if for each positive number e, there 
exists a positive number 5 such that \f (x) — f (y) \ < e for all x,y e S satisfying \x — y\ < 5. 

Basically, the difference between regular continuity and uniform conintuity is that the same 5 works for 
all points in S. 

Here is another theorem worth remembering. 

Theorem 3.9: 

A continuous complex-valued function on a compact subset S of C is uniformly continuous. 
Proof: 

We argue by contradiction. Thus, suppose / is continuous on S but not uniformly continuous. 
Then, there exists an e > for which no positive number 6 satisfies the uniform continuity definition. 
Therefore, thinking of the <5's as ranging through the numbers 1/n, we know that for each positive 
integer n, there exist two points x n and y n in S so that 

1- \y n ~ x n \ < 1/n, and 
2. \f(y n )-f(x n )\>e. 

Otherwise, some 1/n would suffice for a S. Let {x Uk } be a convergent subsequence of {x n } with 
limit x. By (1) and the triangle inequality, we deduce that x is also the limit of the corre- 
sponding subsequence {y Uk } of {y n }. But then f (x) = limf (x nk ) = limf (y„J , implying that 
= lim\f (yn fc ) — / i x n k ) \, which implies that \f (y nk ) — f {x nk ) \ < e for all large enough k. But 
that contradicts (2), and this completes the proof. 

Continuous functions whose domains are not compact sets may or may not be uniformly continuous, as 
the next exercise shows. 

Exercise 3.13 

a. Let / : (0, 1) — ► R be defined by / (x) = 1/x. Prove that / is continuous at each x in its 
domain but that / is not uniformly continuous there. HINT: Set e = 1, and consider the pairs 
of points x n = 1/n and y n = 1/ (n + 1) . 

b. Let / : [l,oo) — > [l,oo) be defined by / (x) = y/x. Prove that / is not bounded, but is 
nevertheless uniformly continuous on its domain. HINT: Take 5 = e. 

Theorem 3.10: 

Let / : S — > T be a continuous 1-1 function from a compact (closed and bounded) subset of C onto 
the (compact) set T. Let g : T — » S denote the inverse function J" 1 of /. Then g is continuous. 
The inverse of a continuous function, that has a compact domain, is also continuous. 
Proof: 

We prove that g is continuous by using Theorem 3.5, p. 63; i.e., we will show that g _1 (A) is closed 
whenever A is a closed subset of C. But this is easy, since g _1 (A) = g^ 1 (A S) = f (A n S) , and 
this is a closed set by Theorem 3.7, p. 65, because AD S is compact. See part (e) of Exercise 2.24. 
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3.2: 

REMARK Using the preceding theorem, and the exercise below, we will show that taking nth 
roots is a continuous function, that is, the function / defined by / (a;) = x x l n is continuous. 

Exercise 3.14 

Use the preceding theorem to show the continuity of the following functions. 

a. Show that if n is an odd positive integer, then there exists a continuous function g defined on 
all of R such that g (x) is an nth root of x for all real numbers x. That is, (g (x)) n = x for all 
real x. (The function / (x) = x n is 1-1 and continuous.) 

b. Show that if n is any positive integer then there exists a unique continuous function g defined 
on [0, oo ) such that g (x) is an nth root of x for all nonnegative x. 

c. Let r = p/q be a rational number. Prove that there exists a continuous function g : [0, oo) — » 
[0, oo) such that g(x) q = x p for all x > 0; i.e., g (x) = x r for all x > 0. 

Theorem 3.11: 

Let / be a continuous 1-1 function from the interval [a, b] onto the interval [c, d] . Then / must be 
strictly monotonic, i.e., strictly increasing everywhere or strictly decreasing everywhere. 
Proof: 

Since / is 1-1, we clearly have that f (a) / / (6) , and, without loss of generality, let us assume 
that c= f (a) < f (b) = d. It will suffice to show that if a and (3 belong to the open interval (a, b) , 
and a < (3, then / (a) < f ((3) . (Why will this suffice?) Suppose by way of contradiction that there 
exists a < (3 in (a, b) for which / (a) > f (j3) . We use the intermediate value theorem to derive 
a contradiction. Consider the four points a < a < (3 < b. Either / (a) < f (a) or / (f3) < / (b) . 
(Why?) In the first case (/(a) < /(a)), f([a,a\) contains every value between f (a) and / (a) . 
And, / ([a, 0\) contains every value between / (a) and / ((3) . So, let v be a number such that 
/ (a) < v,f (f3) < v, and v < f (a) (why does such a number v exist?). By the Intermediate Value 
Theorem, there exists x\ s (a, a) such that v = f (x\) , and there exists an x-i € {a, (3) such that 
v = f (X2) ■ But this contradicts the hypothesis that / is 1-1, since x\ / xi. A similar argument 
leads to a contradiction in the second case / (/?) < f (b) . (See the following exercise.) Hence, there 
can exist no such a and (3, implying that / is strictly increasing on [a, b] . 

Exercise 3.15 

Derive a contradiction from the assumption that / ((3) < f (6) in the preceding proof. 



3.7 Power Series Functions 7 

The class of functions that we know are continuous includes, among others, the polynomials, the rational 
functions, and the nth root functions. We can combine these functions in various ways, e.g., sums, products, 
quotients, and so on. We also can combine continuous functions using composition, so that we know that 
nth roots of rational functions are also continuous. The set of all functions obtained in this manner is called 
the class of "algebraic functions." Now that we also have developed a notion of limit, or infinite sum, we can 
construct other continuous functions. 

We introduce next a new kind of function. It is a natural generalization of a polynomial function. Among 
these will be the exponential function and the trigonometric functions. We begin by discussing functions of 
a complex varible, although totally analogous definitions and theorems hold for functions of a real variable. 

Definition 3.9: 

Let {a„}g° be a sequence of real or complex numbers. By the power series function/ (z) = 
J2^=o a nZ n we mean the function / : S — » C where the domain S is the set of all z € C for which 



7 This content is available online at <http://cnx.Org/content/m36165/l.2/>. 
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the infinite series Y o-nZ n converges, and where / is the rule that assigns to such & z & S the sum 
of the series. 

The numbers {a n } defining a power series function are called the coefficients of the function. 

We associate to a power series function / (z) = Y"^Lo a « z " its sequence {Sn} of partial sums. 
We write 

N 

S N {z) = Y j anZ n . (3.31) 

n=0 

Notice that polynomial functions are very special cases of power series functions. They are the 
power series functions for which the coefficients {a„} are all beyond some point. Note also that 
each partial sum Sn for any power series function is itself a polynomial function of degree less 
than or equal to N. Moreover, if / is a power series function, then for each z in its domain we have 
/ (z) = UttinSn (z) ■ Evidently, every power series function is a "limit" of a sequence of polynomials. 
Obviously, the domain S = Sf of a power series function / depends on the coefficients {a n } 
determining the function. Our first goal is to describe this domain. 

Theorem 3.12: 

Let / be a power series function: / (z) = Y^=o a nZ n with domain S. Then: 

1. belongs to S. 

2. If a number t belongs to S, then every number u, for which |u| < |i|, also belongs to S. 

3. S is a disk of radius r around in C (possibly open, possibly closed, possibly neither, possibly 
infinite). That is, S consists of the disk B r (0) = {z : \z\ < r} possibly together with some of 
the points z for which \z\ = r. 

4. The radius r of the disk in part (3) is given by the Cauchy-Hadamard formula: 

(3.32) 



lira suplanl 1 / 11 ' 

which we interpret to imply that r = if and only if the limsup on the right is infinite, and 
r = oo if and only if that limsup is 0. 

Proof: 

Part (1) is clear. 

To see part 2, assume that t belongs to S and that |u| < \t\. We wish to show that the infinites 
series Y a nU n converges. In fact, we will show that Y \ a nU n \ is convergent, i.e., that Y a n,u n 
is absolutely convergent. We are given that the infinite series Y a nt n converges, which implies 
that the terms a n t n tend to 0. Hence, let B be a number such that |a n 2 n | < B for all n, and 
set a = \u\/\t\. Then a < 1, and therefore the infinite series Y^a n is convergent. Finally, 
|anW n | = |a„t™|a™ < Ba n , which, by the Comparison Test, implies that Y \ a nU n \ is convergent, as 
desired. 

Part (3) follows, with just a little thought, from part 2. 

To prove part (4), note that lira suplan] 1 ^ 71 either is finite or it is infinite, assume first that the 
sequence {lanl 1 ^" - } is not bounded; i.e., that Urn suplcin] 1 / 71 = oo. Then, given any number p, there 
are infinitely many terms lanl 1 /™ that are larger than p. So, for any z / 0, there exist infinitely 
many terms la,^ 1 /" that are larger than l/\z\. But then |a n z™| > 1 for all such terms. Therefore the 
infinite series Y a nZ n is not convergent, since lima n z n is not zero. So no such z is in the domain 
S. This shows that if lira suplan] 1 ^ = oo, then r = = 1/lira sup^^ 1 / 11 . 

Now, suppose the sequence {lanl 1 ^™} is bounded, and let L denote its limsup. We must show that 
1/r = L. We will show the following two claims: (a) if l/\z\ > L, then z € S, and (b) if l/\z\ < L, 
then z $. S. (Why will these two claims complete the proof?) Thus, suppose that l/\z\ > L. Let 
/3 be a number satisfying L < j3 < l/\z\, and let a = (3\z\. Then < a < 1. Now there exists a 
natural number N so that lonj 1 '" < (3 for all n > N, or equivalently \a n \ < (3 n for all n > N. (See 
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part (a) of Exercise 2.17. ) This means that for all n > N we have |a„z™| = |a n //3"||/3,z|" < a n . 
This implies by the Comparison Test that the power series Yl «nz" is absolutely convergent, whence 
convergent. Hence, z € S, and this proves claim (a) above. Incidentally, note also that if L = 0, 
this argument shows that r = oo, as desired. 

To verify claim (b), suppose that \/\z\ < L. Then there are infinitely many terms of the sequence 
llanl 1 ^"} that are greater than l/\z\. (Why?) For each such term, we would then have \a n z n \ > 1. 
This means that the infinite series Yl a>nZ n is not convergent and z £ S, which shows claim b. 

Hence, in all cases, we have that r = 1/lim suplcin] 1 '™, as desired. 

Definition 3.10: 

If / is a power series function, the number r of the preceding theorem is called the radius of 
convergence of the power series. The disk S of radius r around 0, denoted by B r (0) , is called the 
disk of convergence. 

Exercise 3.16 

Compute directly the radii of convergence for the following power series functions, i.e., without 
using the Cauchy-Hadamard formula. Then, when possible, verify that the Cauchy-Hadamard 
formula agrees with your computation. 



a. 
b. 
c. 

d. 

0. 


f ( z) = y^ 1 n z n . 

/(*) = £(-l) n (l/(n + l))* 

/ (*) = £(V (n+1)) z 3n+1 . 


cei 


rcise 3.17 



a. Use part (e) of Exercise 3.1 to show that a power series function p is an even function if and 
only if its only nonzero coefficients are even ones, i.e., the a2fc's. Show also that a power series 
function is an odd function if and only if its only nonzero coefficients are odd ones, i.e., the 

fl2fc+l's. 

b. Suppose / (z) = J2kLo a 2kZ 2k is a power series function that is an even function. Show that 

oo 

f(iz) = Y J (-V k *2 k z 2k = r(z), (3.33) 

fe=0 

where f a is the power series function obtained from / by alternating the signs of its coeffi- 
cients. We call this function f a the alternating version of /. 

c. If g (z) = J2T=o a 2k+iz 2k+1 is a power series function that is an odd function, show that 



oo 

g (iz) = i V (-l) k a 2k+1 z 2k+1 = ig a (z) , (3.34) 



k=0 



where again g a is the power series function obtained from g by alternating the signs of its 
coefficients, 
d. If / is any power series function, show that 

/ (iz) = U (iz) + fo {iz) = f a e (z) + i/« (z) , (3.35) 

and hence that f e (iz) = /" (z) and f (iz) = ip a (z) . 

The next theorem will not come as a shock, but its proof is not so simple. 
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Theorem 3.13: 

Let / (z) = J2 a nZ n be a power series function with radius of convergence r. Then / is continuous 
at each point in the open disk B r (0) , i.e., at each point z for which \z\ < r. 
Proof: 

Let z € B r (0) be given. We must make some auxiliary constructions before we can show that / is 
continuous at z. First, choose a z such that \z\ < \z'\ < r. Next, set b n = \na n \, and define g(z) = 
J2b n z n - By the Cauchy-Hadamard formula, we see that the power series function g has the same 
radius of convergence as the power series function /. Indeed, lira suplbn] 1 ' 71 = lira supn 1 ' n \a n \ = 
Umn x l n lim sup\a n \. Therefore, z belongs to the domain of g. Let M be a number such that each 
partial sum of the series g (z) = J2 n =o ^ nZ ' ls bounded by M. 

Now, let e > be given, and choose 6 to be the minimum of the two positive numbers e\z'\/M 
and \z'\ — \z\. We consider any y for which \y — z\ < S. Then y G B r (0) ,\y\ < \z'\, and 

\f(y)-f(z)\ = lim\S N (y)-S N (z)\ 

Hm\Y,n=O a n{y n - Z n )\ 

< UrnZLoK\\y n - z n \ 

= HmlL,n=i \ a n\\y - z\J2]Zn |y j lk n_1_j l 

< lim 1 £" =1 \a n \\y-z\Y%ZoW\ n ~ i (3.36) 

N 



< Um\y- z\(l/\z'\)Y, n =o n \ a n\\ z 



< 



\y — zllimMr 



N 

\y - zY 

N 

< Slim^4r 

N I 2 I 

< £. 



This completes the proof. 

Exercise 3.18 

a. Let / (z) = J]^Lo a nZ n be a power series function, and let p (z) = J^fcLo bkz k be a polynomial 
function. Prove that f + p and fp are both power series functions. Express the coefficients 
for f + p and fp in terms of the o n 's and bk's. 

b. Suppose / and g are power series functions. Prove that f + g is a power series function. What 
is its radius of convergence? What about cfl What about fgl What about f/g? What about 
l/l? 

Exercise 3.19 

a. Prove that every polynomial is a power series function with infinite radius of convergence. 

b. Prove that 1/z and (1/ (z — 1) (z + 2)) are not power series functions. (Their domains aren't 
right.) 

c. Define / (z) = J^^Lo (— l)™^ 2n+1 - Prove that the radius of convergence of this power series 
function is 1, and that / (z) = 1 * 2 for all z e Bx (0) . Conclude that the rational function 
zj (l + z 2 ) agrees with a power series function on the disk B\ (0) . But, they are not the same 
function. HINT: Use the infinite geometric series. 

Theorem 3.13, p. 69 and Exercise 3.18 and Exercise 3.19 raise a very interesting and subtle point. Suppose 
f (z) = J2 a nZ n is a power series function having finite radius of convergence r > 0. Theorem 3.13, p. 69 
says that / is continuous on the open disk, but it does not say anything about the continuity of / at points 
on the boundary of this disk that are in the domain of /, i,e., at points z for which \z \ = r. and J2 a n z o 
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converges. Suppose g (z) is a continuous function whose domain contains the open disk B r (0) and also a 
point zo, and assume that / (z) = g (z) for all z € B r (0) . Does / (zq) have to agree with g (z^)l It's worth 
some thought to understand just what this question means. It amounts to a question of the equality of two 
different kinds of limits. / (zo) is the sum of an infinite series, the limit of a sequence of partial sums, while, 
because g is continuous at zo,g (^o = Hrn z ^ Zo g (z) . At the end of this chapter, we include a theorem of Abel 
that answers this question. 

The next theorem is the analog for power series functions of part (2) of Theorem 3.1, p. 57 for polynomials. 
We call it the "Identity Theorem," but it equally well could be known as the "Uniqueness of Coefficients 
Theorem," for it implies that different coefficients mean different functions. 

Theorem 3.14: Identity Theorem 

Let f (z) =Y1 a nZ n be a power series function with positive radius of convergence r. Suppose {zk} 
is a sequence of nonzero distinct numbers in the domain of / such that: 

1. limzk = 0. 

2. / ( Zk ) = for all k. 

Then / is identically (/ (z) = for all z & S). Moreover, each coefficient o„ of / equals 0. 
Proof: 

Arguing by induction on n, let us prove that all the coefficients a n are 0. First, since / is continuous 
at 0, and since limzk = 0, we have that oo, which equals / (0) , = limf (zk) = 0. 
Assume then that ao = a\ = ... = a„_i = 0. Then 

f(z) = a n z n + a n+1 z n ^ + ... 
= z n V°° bz j 

where bj = a n +j. If g is the power series function defined by g (z) = Y^ bjZ^ , then, by the Cauchy- 
Hadamard Formula, we have that the radius of convergence for g is the same as that for /. (Why 
does lira suplbj] 1 ^ = lim sup\ak\ ?) We have that / (z) = z n g(z) for all z in the common disk 
of convergence of these functions / and g. Since, for each k, Zk / and f (zk) = z k9{ z k) = 0, it 
follows that g (zf.) = for every k. Since g is continuous at 0, it then follows as in the argument 
above that g (0) = 0. But, g (0) = &o = o-n- Hence a n = 0, and so by induction all the coefficients of 
the power series function / are 0. Clearly this implies that / (z) is identically 0. 

Rule 3.1: 

Suppose / and g are two power series functions, that {zu} is a sequence of nonzero points that 
converges to 0, and that / (zk) = 9 {zk) for all k. Then / and g have the same coefficients, the same 
radius of convergence, and hence / (z) = g (z) for all z in their common domain. 

Exercise 3.20 

a. Prove the preceding corollary. (Compare with the proof of Theorem 3.1, p. 57.) 

b. Use the corollary, and the power series function g (z) = z, to prove that / (z) = \z\ is not a 
power series function. 

c. Show that there are power series functions that are not polynomial functions. 

d. Let / (z) = Yl a nZ n be a power series function with infinite radius of convergence, all of 
whose coefficients are positive. Show that there is no rational function r = p/q for which 
f (z) = r (z) for all complex numbers z. Conclude that the collection of power series functions 
provides some new functions. HINT: Use the fact that for any n we have that / (x) > a n x n 
for all positive x. Then, by choosing n appropriately, derive a contradiction to the resulting 
fact that \p(x) /q(x) | > a n x n for all positive x. See part (b) of Exercise 3.2. 
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3.8 The Elementary Transcendental Functions 8 

Having introduced a class of new functions (power series functions) , we might well expect that some of these 
will have interesting and unexpected properties. So, which sets of coefficients might give us an exotic new 
function? Unfortunately, at this point in our development, we haven't much insight into this question. It 
is true, see Exercise 3.16, that most power series functions that we naturally write down have finite radii of 
convergence. Such functions may well be new and fascinating, but as a first example, we would prefer to 
consider a power series function that is defined everywhere, i.e., one with an infinite radius of convergence. 
Again revisiting Exercise 3.16, let us consider the coefficients a n = 1/nl. This may seem a bit ad hoc, but 
let's have a look. 

Definition 3.11: 

Define a power series function, denoted by exp, as follows: 

°° z n 
e xp(z) = ]T— . (3.38) 

We will call this function, with 20-20 hindsight, the exponential function. 

What do we know about this function, apart from the fact that it is defined for all complex numbers? 
We certainly do not know that it has anything to do with the function e z ; that will come in the next chapter. 
We do know what the number e is, but we do not know how to raise that number to a complex exponent. 

All of the exponential function's coefficients are positive, and so by part (d) of Exercise 3.20 exp is not a 
rational function; it really is something new. It is natural to consider the even and odd parts exp e and exp 
of this new function. And then, considering the constructions in Exercise 3.17, to introduce the alternating 
versions exp a e and exp a of them. 

Definition 3.12: 

Define two power series functions cosh (hyperbolic cosine) and sinh (hyperbolic sine) by 

, . , exp (z) + exp {— z) , . , . . exp(z) — exp(z) . „ . 

cosh{z) = y ' — and sinh(z) = y ' ^- L - L , (3.39) 

and two other power series functions cos (cosine) and sin (sine) by 

i\ uc \ <^xp {iz) + exp {-iz) 
cos (z) = cosh (iz) = (3.40) 

and 

, n exp (iz) — exp (— iz) 
sin (z) = -isinh {iz) = Fy ' — ^ '-. (3.41) 

The five functions just defined are called the elementary transcendental functions, the sinh and 
cosh functions are called the basic hyperbolic functions, and the sine and cosine functions are called 
the basic trigonometric or circular functions. The connections between the hyperbolic functions 
and hyperbolic geometry, and the connection between the trigonometric functions and circles and 
triangles, will only emerge in the next chapter. From the very definitions, however, we can see a 
connection between the hyperbolic functions and the trigonometric functions. It's something like 
interchanging the roles of the real and imaginary axes. This is probably worth some more thought. 

Exercise 3.21 



3 This content is available online at <http://cnx.Org/content/m36160/l.2/>. 
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a. Verify the following equations: 



= 1 + z +^ + l^ + -+TT + -, (3-42) 

= cosh (z) + sinh (z) . 



sin(z) = z-^ + ^-^ + .^ + i-lfj^yl 

— v^°° ( -\\ k z 2fc + 1 

— l^k=a\ 1 ) (2fc+i)p 
cos(z) = l-^ + ^-^ + ... + (-l) k 1 g fJ + 

— v^°° ( -\\ k Z 2> ° 

— l^k=0\ 1 ) (2fe)!' 

00 z 2k+l 



(3.43) 
(3.44) 



and 



sinh (z) = V -, (3.45) 

K) Aj(2fc + 1)!' v ^ 

00 2fc 

fc=0 v '' 

(These expressions for the elementary transcendental functions are perhaps the more familiar 
ones from a calculus course.) 

b. Compute the radii of convergence for the elementary transcendental functions. HINT: Do not 
use the Cauchy-Hadamard formula. Just figure out for which z's the functions are defined. 

c. Verify that exp (0) = l,sin (0) = sinh (0) = 0, and cos (0) = cosh (0) = 1. 

d. Prove that all five of the elementary transcendental functions are not rational functions. 

e. Can you explain why sin 2 (z) + cos 2 (z) = 1? What about the " Addition Formula" 

sin (z + w) = sin (z) cos (w) + cos (z) sin (w) . (3.47) 

Exercise 3.22 

a. Show that the elementary transcendental functions map real numbers to real numbers. That 
is, as functions of a real variable, they are real-valued functions. 

b. Show that the exponential function exp is not bounded above. Show in fact that, for each 
nonnegative integer n,exp (x) jx n is unbounded. Can you show that exp(x) = e x l What, in 
fact, does e x mean if x is an irrational or complex number? 

At this point, we probably need a little fanfare! 

Theorem 3.15: THEOREM 3.14159 (Definition of n) 

There exists a smallest positive number x for which sin (x) = 0. We will denote this distinguished 
number x by the symbol w. 
Proof: 

First we observe that sin(l) is positive. Indeed, the infinite series for sin{\) is alternating. It 
follows from the alternating series test (Theorem 2.18) that sin (1) > 1 — 1/6 = 5/6. 
Next, again using the alternating series test, we observe that sin (4) < 0. Indeed, 

43 45 47 ^9 
sin (4) < 4 - - + - - - + - w -0.4553 < 0. (3.48) 

Hence, by the intermediate value theorem, there must exist a number c between 1 and 4 such that 
sin (c) = 0. So, there is at least one positive number x such that sin (x) = 0. However, we must 
show that there is a smallest positive number satisfying this equation. 
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Let A be the set of all x > for which sin (x) = 0. Then A is a nonempty set of real numbers 
that is bounded below. Define n = infA. We need to prove that sin (ir) = 0, and that 7r > 0. 
Clearly then it will be the smallest positive number x for which sin (x) = 0. 

By Exercise 2.20, there exists a sequence {xk} of elements of A such that n = limxk- Since sin 
is continuous at ir, it follows that sin (it) = limsin (xk) = limO = 0. Finally, if ir were equal to 0, 
then by the Identity Theorem, Theorem 3.14, we would have that sinx = for all x. Since this is 
clearly not the case, we must have that n > 0. 

Hence, -k is the smallest (minimum) positive number x for which sin [x) = 0. 

As hinted at earlier, the connection between this number 7r and circles is not at all evident at the moment. 
For instance, you probably will not be able to answer the questions in the next exercise. 

Exercise 3.23 

a. Can you see why sin (x + 2ir) = sin (x)l That is, is it obvious that sin is a periodic function? 

b. Can you prove that cos (ir) = — 1? 

3.3: 
REMARK Defining it to be the smallest positive zero of the sine function may strike many people 
as very much "out of the blue." However, the zeroes of a function are often important numbers. 
For instance, a zero of the function x 2 — 2 is a square root of 2, and that number we know was 
exztremely important to the Greeks as they began the study of what real numbers are. A zero 
of the function z 2 + 1 is something whose square is -1, i.e., negative. The idea of a square being 
negative was implausible at first, but is fundamental now, so that the zero of this particular function 
is critical for our understanding to numbers. Very likely, then, the zeroes of any "new" function 
will be worth studying. For instance, we will soon see that, perhaps disappointingly, there are no 
zeroes for the exponential function: exp (z) is never 0. Maybe it's even more interesting then that 
there are zeroes of the sine function. 

The next theorem establishes some familiar facts about the trigonometric functions. 

Theorem 3.16: 

1. exp (iz) = cos (z) + isin (z) for all zGC. 

2. Let {zk} be a sequence of complex numbers that converges to 0. Then 

sin (zi.) 
Urn y — = 0. (3.49) 

3. Let {zk} be a sequence of complex numbers that converges to 0. Then 

l-cos(z k ) 1 
hm 2 = q- (3.50) 

z k 2 

Exercise 3.24 

Prove Theorem 3.16, p. 74. 

HINT: For parts (2) and (3), use Theorem 3.13, p. 69. 
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3.9 Analytic Functions and Taylor Series 9 

Definition 3.13: 

Let S be a subset of C, let / : S — > C be a complex-valued function, and let c be a point of S. 
Then / is said to be expandable in a Taylor series around c with radius of convergence r if there 
exists an r > such that B r (c) C S, and / (z) is given by the formula 



oc 



/(z) = ^a„(^-c) n (3.51) 

n=0 

for all z & B r (c) . 

Let S be a subset of R, let / : S — > i? be a real- valued function on S, and let c be a point of 5. 
Then / is said to be expandable in a Taylor series around c with radius of convergence r if there 
exists an r > such that the interval (c — r, c + r) C S, and / (x) is given by the formula 

oo 

f{x) = Y j a n {x-c) n (3.52) 

71=0 

for all x € (c — r, c + r) . 

Suppose S is an open subset of C. A function / : S — > C is called analytic on S if it is expandable 
in a Taylor series around every point c of S 1 . 

Suppose 5 is an open subset of R. A function / : S — > C is called reai analytic on S if it is 
expandable in a Taylor series around every point c of S. 

Theorem 3.17: 

Suppose S is a subset of C, that / : S — » C is a complex-valued function and that c belongs to S. 
Assume that / is expandable in a Taylor series around c with radius of convergence r. Then / is 
continuous at each z £ B r (c) . 

Suppose S is a subset of R, that / : S — > R is a real-valued function and that c belongs to S. 
Assume that / is expandable in a Taylor series around c with radius of convergence r. Then / is 
continuous at each x € (c — r, c + r) . 
Proof: 

If we let g be the power series function given by g (z) = Yl a nZ n , and T be the function defined 
by T (z) = z — c, then f (z) = g (T (z)) , and this theorem is a consequence of Theorem 3.3, The 
composition of continuous functions is continuous., p. 61 and Theorem 3.13, p. 69. 

Exercise 3.25 

Prove that / (z) = 1/ z is analytic on its domain. 

HINT: Use r = \c\, and then use the infinite geometric series. 

Exercise 3.26 

State and prove an Identity Theorem, analogous to Theorem 3.14, Identity Theorem, p. 71, for 
functions that are expandable in a Taylor series around a point c. 

Exercise 3.27 

a. Prove that every polynomial is expandable in a Taylor series around every point c. HINT: 
Use the binomial theorem. 

b. Is the exponential function expandable in a Taylor series around the number —1? 



9 This content is available online at <http://cnx.Org/content/m36168/l.2/>. 
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3.10 Uniform Convergence 10 

We introduce now two different notions of the limit of a sequence of functions. Let S be a set of complex 
numbers, and let {/„} be a sequence of complex-valued functions each having domain S. 

Definition 3.14: 

We say that the sequence {f n }converges or converges pointwise to a function / : S — > C if for 
every x G S and every e > there exists a natural number N, depending on x and e, such that for 
every n > N,\f n (x) — f (x)\ < e. That is, equivalently, {/ n } converges pointwise to / if for every 
x G S the sequence {/„ (cc)} of numbers converges to the number / (x) . 

We say that the sequence {f n } converges uniformly to a function / if for every e > 0, there 
exists an N, depending only on e, such that for every n > N and every x G S,\f n (x) — f (x) | < e. 
If {u n } is a sequence of functions defined on S, we say that the infinite series Yl u n converges 
uniformly if the sequence {Sn = X^ n =o u «) °f P ai "tial sums converges uniformly. 
These two definitions of convergence of a sequence of functions differ in subtle ways. Study the word 
order in the definitions. 

Exercise 3.28 

a. Prove that if a sequence {/„} of functions converges uniformly on a set 5 to a function / then 
it converges pointwise to /. 

b. Let S = (0, 1) , and for each n define /„ (x) = x n . Prove that {/„} converges pointwise to the 
zero function, but that {/„} does not converge uniformly to the zero function. Conclude that 
pointwise convergence does not imply uniform convergence. HINT: Suppose the sequence 
does converge uniformly. Take e = 1/2, let N be a corresponding integer, and consider x's of 
the form x = 1 — h for tiny h'a. 

c. Suppose the sequence {/„} converges uniformly to / on S, and the sequence {g n } converges 
uniformly to g on S. Prove that the sequence {/„ + g n } converges uniformly to / + g on S. 

d. Suppose {/„} converges uniformly to / on S, and let c be a constant. Show that {c/„} 
converges uniformly to cf on S. 

e. Let S = R, and set /„ (x) = x + (1/n) . Does {/„} converge uniformly on S? Does {/^} 
converge uniformly on 5? What does this say about the limit of a product of uniformly 
convergent sequences versus the product of the limits? 

f. Suppose a and b are nonnegative real numbers and that \a—b\ < e 2 . Prove that \\fa— v6| < 2e. 
HINT: Break this into cases, the first one being when both y/a and \/b are less than e. 

g. Suppose {/„} is a sequence of nonnegative real- valued functions that converges uniformly to 
/ on S. Use part (f) to prove that the sequence {y/f^} converges uniformly to y/J. 

h. For each positive integer n, define /„ on (—1, 1) by /„ (x) = \x\ . Prove that the sequence 

{/„} converges uniformly on (—1,1) to the function f (x) = \x\. HINT: Let e > be given. 
Consider |cc|'s that are < e and |cc|'s that are > s. For |cc| < e, show that |/„ (x) — f (x) | < e 
for all n. For \x\ > e, choose N so that \e x l n — 1| < e. How? 

Exercise 3.29 

Let {f n } be a sequence of functions on a set S, let / be a function on 5, and suppose that for each 
n we have \f (x) — /„ (x) \ < 1/n for all x G S. Prove that the sequence {/„} converges uniformly 
to/. 

We give next four important theorems concerning uniform convergence. The first of these theorems is 
frequently used to prove that a given function is continuous. The theorem asserts that if / is the uniform 
limit of a sequence of continuous functions, then / is itself continuous. 



°This content is available online at <http://cnx.Org/content/m36178/l.2/>. 
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Theorem 3.18: The uniform limit of continuous functions is continuous. 

Suppose {/„} is a sequence of continuous functions on a set S C C, and assume that the sequence 
{/„} converges uniformly to a function /. Then / is continuous on S. 
Proof: 

This proof is an example of what is called by mathematicians a "3e argument." 

Fix anieS and an e > 0. We wish to find a 6 > such that if y e S and \y — x\ < 6 then 
\f(y)-f(x)\<e. 

We use first the hypothesis that the sequence converges uniformly. Thus, given this e > 0, 
there exists a natural number N such that if n > N then \f (z) — f n (z) \ < e/3 for all z € S. 
Now, because /jv is continuous at x, there exists a 6 > such that if y e S and \y — x\ < S then 
|/jv (y) - In {x) I < e/3. So, if y e S and |y - x| < S, then 

|/(y)-/(x)| = \f(y)-f N (y) + f N (y)-f N ( X ) + f N ( X )-f( X )\ 

< 1/ (y) - In (y) I + |/jv (y) - In (x) \ + \f N (x) -f(x)\ 



< 



(3.53) 

3 "^ 3 
£. 



This completes the proof. 



3.4: 

REMARK Many properties of functions are preserved under the taking of uniform limits, e.g., 
continuity, as we have just seen. However, not all properties are preserved under this limit process. 
Differentiability is not, integrability is sometimes, being a power series function is, and so on. We 
must be alert to be aware of when it works and when it does not. 

Theorem 3.19: Weierstrass M-Test 

Let {u n } be a sequence of complex-valued functions defined on a set S C C. Write Sn f° r the 
partial sum Sn (x) = J2 n =o Un ( :E ) ■ Suppose that, for each n, there exists an M n > for which 
\u„ (x) | < M n for all x £ S. Then 

1. If Y^ M n converges, then the sequence {Sn} converges uniformly to a function S. That is, the 
infinite series J^ u n converges uniformly. 

2. If each function u n is continuous, and J2 M n converges, then the function S of part (1) is 
continuous. 

Proof: 

Because Yl M n is convergent, it follows from the Comparison Test that for each x € S the infinite 
series Y^=o u n{ x ) ls absolutely convergent, hence convergent. Define a function S by S (x) = 

Z^=o M « ( x ) = M™Sn (x) ■ 

To show that {Sn} converges uniformly to S, let e > be given, and choose a natural number 
TV such that J2^Ln+i ^ n < £ - This can be done because Yl M n converges. Now, for any x € S 
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and any m > N, we have 



S(x)- 


S m \X) 


= 


Urn S k (x) - S m (x) 

k — >oo 






= 


Urn (S k (x) - S m (x) 

k — >oo 






= 


Urn \S k (x) - S m (x) 

h — >oo 






= 


Km 1 EL+i u n 0) 

fc — ► OO 






< 


Urn Etm+i K 0) 






< 


Jim En=m+1 M « 

K — !-00 






= 


V°° M 






< 


E^M n 






< 


£. 



(3.54) 



This proves part (1). 

Part (2) now follows from part (1) and Theorem 3.18, The uniform limit of continuous functions 
is continuous., p. 76, since the Sn's are continuous. 

Theorem 3.20: 

Let / (z) = E^Lo a n zTl b e a P ower series function with radius of convergence r > 0, and let 
{Sn (z)} denote the sequence of partial sums of this series: 

N 

S N (z) = Y j anZ n . (3.55) 

n=0 

If < r < r, then the sequence {Sn} converges uniformly to / on the diski? r ' (0) . 
Proof: 

Define a power series function g by g (z) = E^L |a„|z", and note that the radius of convergence 
for g is the same as that for /, i.e., r. Choose t so that r < t < r. Then, since t belongs to the disk of 
convergence of the power series function g, we know that E^Lo l a "l^™ converges. Set m n = \a n \t n , 
and note that J2 m n converges. Now, for each z € B r < (0) , we have that 

\a n z n \ < \a n \r' n < \a n \t n = m„, (3.56) 

so that the infinite series ^a n z n converges uniformly on B r ' (0) by the Weierstrass M-Test. 

Exercise 3.30 

Let / (z) = E^Lo z ™- R- ecan that the radius of convergence for / is 1. Verify that the sequence 
{Sn} of partial sums of this power series function fails to converge uniformly on the full open disk 
of convergence By (0) , so that the requirement that r < r is necessary in the preceding theorem. 

The next theorem shows that continuous, real-valued functions on closed bounded intervals are uniform limits 
of step functions. Step functions have not been mentioned lately, since they aren't continuous functions, but 
this next theorem will be crucial for us when we study integration in Section 5.1. 

Theorem 3.21: 

Let / be a continuous real- valued function on the closed and bounded interval [a, b] . Then there 
exists a sequence {h n } of step functions on [a, b] that converges uniformly to /. 
Proof: 

We use the fact that a continuous function on a compact set is uniformly continuous (Theorem 3.9, 
p. 66). 
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For each positive integer n, let S n be a positive number satisfying \f (x) — f (y)\ < 1/n if 
\x— y\ < S n . Such a 8 n exists by the uniform continuity of / on [a, b] . Let P n = {xq < X\ < ... < x mn } 
be a partition of [a, b] for which Xi — cCj_i < d n for all 1 < i < m n . Define a step function h n on 
[a, b] as follows: 

If Xi-\ < x < Xi, then h n (x) = f (<Ej_i) . This defines h n (x) for every x G [a, b) , and we 
complete the definition of h n by setting h n (6) = / (b) . It follows immediately that h n is a step 
function. 

Now, we claim that \f (x) — h n (x) \ < 1/n for all x s [a, b] . This is clearly the case for x = b, 
since / (6) = h n (6) for all n. For any other x, let i be the unique index such that :ej_i < x < #j. 
Then 

|/ (x) - h n (x) \ = \f(x)-f (x t -i) I < 1/n (3.57) 

because \x — £C^_i| < <5„. 

So, we have defined a sequence {h n } of step functions, and the sequence {h n } converges uni- 
formly to / by Exercise 3.29. 

We close this chapter with a famous theorem of Abel concerning the behavior of a power series function 
on the boundary of its disk of convergence. See the comments following Exercise 3.19. 

Theorem 3.22: Abel 

Suppose / (z) = E^Lo a n zU ls a P ower series function having finite radius of convergence r > 0, 
and suppose there exists a point zq on the boundary of B r (0) that is in the domain of /; i.e., 
Y^ a n ZQ converges to / (zq) . Suppose g is a continuous function whose domain contains the open 
disk B r (0) as well as the point zq, and assume that / (z) = g (z) for all z in the open disk B r (0) . 
Then / (zo) must equal g (z ) • 
Proof: 

For simplicity, assume that r = 1 and that zq = 1. See the exercise that follows this proof. Write 
S n for the partial sum of the a n 's: S n = E™=o a «- ^ n ^ ne f°ll° wm g computation, we will use the 
Abel Summation Formula in the form 

N JV-1 

E a n z n = S N z N +J2 S n (*" ~ z n+1 ) ■ (3-58) 

n=0 n=0 

See Exercise 2.30. Let e be a positive number. Then, for any < t < 1 and any positive integer 
N, we have 

\g(l) - /(1)| = \g(l) - f(t) + fit) - En=o«ni" + Eto^ n " /(I) I < (3-59) 
\g{\) - g(t)\ + |/(t) - En=o^"l + IElo«^ n - /(I) I < IsU) - 

g(t)\ + |/(«) - Elo«nt1 + \s N t N + ES^r-O - /(i)| = 
|<? (i) - g (t) | + |; (o - Elo *nt n + \s N t N + El'o 1 (5„ - s N ) (r - r +1 ) + 

ShrE£o 1 (t B -* B+1 ) -/(l)l = 1^(1) ~ </(0l + 1/(0 ~ £to«n* n + 

I E^o 1 (5„ - S*) (r - r +1 ) + s„ (> + E^o 1 (* n - t n+1 )) - f (i) | < k (i) - 

^ (t) I + 1/ (0 - En= «n* B | + I El'o 1 (5« - 5iv) (r - r +1 ) I + |s* - / (1) | < 
\ g (i) - 5 (0I + |/(0 - Elo«n* B l + I £L> (s n - s N ) (r - r +1 ) | + 
I En=P +1 (5« - ^) (i n - i n+1 ) I + \s N - /(i) I < |^(i) -- g(t)\ + 1/(0 ^ 

En=0 «n*1 + I Er=0 ^n ~ S N ) (t n - T +1 ) | + ES +1 |5 n " S N \ (t n - T +1 ) + 

|5iv-/(l)|=ti + t 2 + t3 + *4 + *5- 
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First, choose an integer Mi so that if P and N are both larger than Mi, then t& < e. (The 
sequence {Sk} is a Cauchy sequence, and Yl if k ~ t k+1 is telescoping.) 

Fix such a P > Mi. Then choose a S > so that if 1 > t > 1 — 5, then both t\ and t 3 < e. How? 

Fix such a t. Finally, choose a N, greater than Mi, and also large enough so that both ti and 
£5 are less than e. (How?) 

Now, \g (1) — / (1) I < 5e. Since this is true for every e > 0, it follows that / (1) = g (1) , and 
the theorem is proved. 

Exercise 3.31 

Let f,g,r, and zq be as in the statement of the preceding theorem. Define / (z) = f (zoz) and 
9 (z) = g(z z). 

a. Prove that / is a power series function / (z) = Y^=o b n z n , with radius of convergence equal 
to 1, and such that X^^lo ^™ conver g es to / (1) ; i.e., 1 is in the domain of / . 

b. Show that g is a continuous function whose domain contains the open disk B\ (0) and the 
point z = 1. 

c. Show that, if / (1) =g (1), then f (zq) = g (zo) ■ Deduce that the simplification in the 
preceding proof is justified. 

d. State and prove the generalization of Abel's Theorem to a function / that is expandable in a 
Taylor series around a point c. 



Chapter 4 

Differentiation, Local Behavior 



4.1 Differentiation, Local Behavior E~i7r = -l. 1 

In this chapter we will finally see why e l7r is — 1. Along the way, we will give careful proofs of all the 
standard theorems of Differential Calculus, and in the process we will discover all the familiar facts about 
the trigonometric and exponential functions. At this point, we only know their definitions as power series 
functions. The fact that sin 2 + cos 2 = 1 or that e x+y = e x e v are not at all obvious. In fact, we haven't even 
yet defined what is meant by e x for an arbitrary number x. 
The main theorems of this chapter include: 

1. The Chain Rule (Theorem 4.7, Chain Rule, p. 89), 

2. The Mean Value Theorem (Theorem 4.9, Mean Value Theorem, p. 93), 

3. The Inverse Function Theorem (Theorem 4.10, Inverse Function Theorem, p. 95), 

4. The Laws of Exponents ( and Corollary 4.1, Law of Exponents, p. 97), and 

5. Taylor's Remainder Theorem (Theorem 4.19, Taylor's Remainder Theorem, p. 107). 



4.2 The Limit of a Function 2 

The concept of the derivative of a function is what most people think of as the beginning of calculus. 
However, before we can even define the derivative we must introduce a kind of generalization of the notion 
of continuity. That is, we must begin with the definition of the limit of a function. 

Definition 4.1: 

Let / : S — > C be a function, where S C C, and let c be a limit point of S that is not necessarily 
an element of S. We say that fhas a limit L as z approaches c, and we write 

limf{z) = L, (4.1) 

z — >c 

if for every e > there exists a 6 > such that if z G S and < \z — c\ < 5, then \f (z) — L\ < e. 
If the domain S is unbounded, we say that f has a limit L as z approachesoo, and we write 

L= limf(z), (4.2) 

z — >oo 

if for every e > there exists a positive number B such that if z € S and \z\ > B, then \f (z)—L\ < e. 



1 This content is available online at <http://cnx.Org/content/m36173/l.2/>. 
2 This content is available online at <http://cnx.Org/content/m36185/l.2/>. 
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Analogously, if S C R, we say Um x ^oof ( x ) = L if for every e > there exists a real number 
B such that if x e S and x > B, then |/ (x) — L| < e. And we say that lim x ^-oof ( x ) = L if for 
every e > there exists a real number B such that if x € 5 and x < B, then |/ (x) — L| < e. 

Finally, for / : (a, b) — ► C a function of a real variable, and for c € [a, 6] , we define the one- 
sided (left and right) limits of / at c. We say that / has a left hand limit of L at c, and we write 
L = lim x ^c-of ( x ) > if for every e > there exists a S > such that if x € (a, b) and < c — x < <5 
then |/ (x) — L\ < e. We say that / has a right hand limit of L at c, and write L = lim x ^ c +of i x ) , 
if for every e > there exists a S > such that if x € S 1 and < x — c < 5 then |/ (x) — L\ < e. 

The first few results about limits of functions are not surprising. The analogy between functions having 
limits and functions being continuous is very close, so that for every elementary result about continuous 
functions there will be a companion result about limits of functions. 

Theorem 4.1: 

Let c be a complex number. Let / : S — > C and g : S — > C be functions. Assume that both /and 
g have limits as x approaches c. Then: 

1. There exists a 5 > and a positive number M such that if z G S and < \z — c\ < 5 then 

|/ (z) | < M. That is, if / has a limit as z approaches c, then / is bounded near c. 
2. 

lira (/ (z) + g (z)) = !im/ (z) + limg (z) . (4-3) 



Urn (/ (z) g (z)) = lim/ (z) Zzm<7 (z) . (4-4) 

2 — *C 2 — *C 2 — »c 

4. If lim z ^ c g (z) / 0, then 

f (z) lim z ->rf (z) 
l im i±l = ~ c ^ , 4.5 

z-»c 5 (z) hm z -> c g(z) 

5. If u and u are the real and imaginary parts of a complex-valued function /, then u and v have 
limits as z approaches c if and only if / has a limit as z approaches c. And, 

limf (z) = limu (z) + ilimv (z) . (4-6) 

2 — >C Z — >C Z — >C 

Exercise 4.1 

a. Prove Theorem 4.1, p. 82. HINT: Compare with Theorem 3.2, p. 60. 

b. Prove that lim x ^ c f (x) = L if and only if, for every sequence {x„} of elements of S that 
converges to c, we have limf (x n ) = L. HINT: Compare with Theorem 3.4, p. 62. 

c. Prove the analog of Theorem 4.1, p. 82 replacing the limit as z approaches c by the limit as 
z approaches oo. 

Exercise 4.2 

a. Prove that a function / : S — > C is continuous at a point c of S if and only if lim x -* c f ( x ) = 
/ (c) . HINT: Carefully write down both definitions, and observe that they are verbetim the 
same. 

b. Let / be a function with domain S, and let c be a limit point of S that is not in S. Suppose 
g is a function with domain S U {c}, that / (x) = g (x) for all x € S, and that g is continuous 
at c. Prove that lim x ^ c f (x) = g (c) . 

Exercise 4.3 

Prove that the following functions / have the specified limits L at the given points c. 
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a. 


f(x) = 


--(x 3 


-8)/(* 2 


-4). 


c = 


= 2. 


and L = 


- 3. 


b. 


/(*) = 


--{x 2 


+ 1) / (x 3 


+ 1). 


c = 


= 1. 


and L = 


: 1. 


c. 


/(*) = 


= (x 8 


-l)/(x 6 


+ 1). 


c = 


= i, 


and L = 


-4/3 



d. / (a;) = (sin (x) + cos (x) — exp (xj) / (x 2 ) , c = 0, and L = — 1. 

Exercise 4.4 

Define / on the set S of all nonzero real numbers by / (x) = c if x < and / (x) = d if x > 0. 
Show that linix^of (x) exists if and only if c = d. 

(b) Let / : (a, b) — > C be a complex-valued function on the open interval (a, 6) . Suppose c is a 
point of (a, b) . Prove that lim x ^ c f (x) exists if and only if the two one-sided limits Z«m x _>c-o/ (x) 
and lirrix^c+of (x) exist and are equal. 

Exercise 4.5: Change of variable in a limit 

Suppose / : S — > C is a function, and that lim x ^ c f (x) = L. Define a function g by g (y) = 

f(y + c). 

a. What is the domain of gl 

b. Show that is a limit point of the domain of g and that lim y ^og (y) = lim x ^ c f (x) . 

c. Suppose T C C, that h : T — > 5, and that Um y ^dh (y) = c. Prove that 

ton/ (/i (y)) = limf (x) = L. (4.7) 

4.1: 

REMARK When we use the word " interior" in connection with a set S, it is obviously important 
to understand the context; i.e., is S being thought of as a set of real numbers or as a set of complex 
numbers. A point c is in the interior of a set S of complex numbers if the entire disk B e (c) of radius 
e around c is contained in S. While, a point c belongs to the interior of a set S of real numbers 
if the entire interval (c — e, c + e) is contained in S. Hence, in the following definition, we will be 
careful to distinguish between the cases that / is a function of a real variable or is a function of a 
complex variable. 

4.3 The Derivative of a Function 3 

Now begins what is ordinarily thought of as the first main subject of calculus, the derivative. 

Definition 4.2: 

Let S be a subset of R, let / : S — » C be a complex- valued function (of a real variable) , and let c 
be an element of the interior of S. We say that / is different iable at c if 

lim (4.8 

h^i) h 

exists. (Here, the number h is a real number.) 

Analogously, let S be a subset of C, let / : S — > C be a complex-valued function (of a complex 
variable), and let c be an element of the interior of S. We say that / is different iable at c if 

ujs^zm ( 4. 9) 

exists. (Here, the number h is a complex number.) 



3 This content is available online at <http://cnx.Org/content/m36186/l.2/>. 
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If / : S — » C is a function either of a real variable or a complex variable, and if S' denotes the 
subset of S consisting of the points c where / is differentiable, we define a function /' : S' — » C by 

„> , n f (x + h) — f (x) 
f (x) = lim J y ' J -^-. 4.10 

The function /' is called the derivative of /. 

A continuous function / : [a, b] — » C that is differentiable at each point x € (a, 6) , and whose the 
derivative /' is continuous on (a, 6) , is called a smooth function on [a, b] . If there exists a partition 
{a = xo < x\ < ... < x n = b} of [a, b] such that / is smooth on each subinterval [xj_i,a;i] , then / 
is called piecewise smooth on [a, b] . 

Higher order derivatives are defined inductively. That is, /' is the derivative of /', and so on. 
We use the symbol /(") for the nth derivative of /. 

4.2: 

REMARK In the definition of the derivative of a function /, we are interested in the limit, as h 
approaches 0, not of / but of the quotient q (h) = h ■ Notice that is not in the domain 

of the function q, but is a limit point of that domain. This is the reason why we had to make 
such a big deal above out of the limit of a function. The function q is often called the differential 
quotient. 

4.3: 

REMARK As mentioned in Section 3.1, we are often interested in solving for unknowns that 
are functions. The most common such problem is to solve a differential equation. In such a 
problem, there is an unknown function for which there is some kind of relationship between it 
and its derivatives. Differential equations can be extremely complicated, and many are unsolvable. 
However, we will have to consider certain relatively simple ones in this chapter, e.g., /' = /,/' = — /, 
and /'' = ±f. 

There are various equivalent ways to formulate the definition of differentiable, and each of these 
ways has its advantages. The next theorem presents one of those alternative ways. 

Theorem 4.2: 

Let c belong to the interior of a set S (either in R or in C), and let / : S — > C be a function. Then 
the following are equivalent. 



1. /is differentiable at c. That is, 



Um f{c+h) - f{c) exists. (4.11) 



UmlM li^- exists. (4.12) 

x^c X — C 

3. There exists a number L and a function 9 such that the following two conditions hold: 

f(c+h)-f{c) = Lh + 9{h) (4.13) 

and 

9(h) , 

lim——^=Q. 4.14 

h^{) h 

In this case, L is unique and equals /' (c) , and the function 9 is unique and equals / (c + h) — 
f(c)-f(c)h. 
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Proof: 

That (1) and (2) are equivalent follows from by writing x as c + h. 
Suppose next that / is differentiable at c, and define 



Set 

Then clearly 

which is (4.13). Also 



f'(c) = Um f{C+h) - f{c) . (4.15) 



>(h) = f(c+h)-f(c)-f'(c)h. (4.16) 

f(c + h)-f(c) = Lh + 9(h), (4.17) 



0(h), _ 


,f(c+h)-f(c)-f(c)h 


h 1 


- \^ +h \' m /'(c) 



(4.18) 

which tends to as h approaches because / is differentiable at c. Hence, we have established 
(4.13) and (4.14), showing that (1) implies (3). 

Finally, suppose there is a number L and a function 9 satisfying (4.13) and (4.14). Then 

f(c + h)-f(c )=L+ ew 

h h 

which converges to L as h approaches by (4.14) and part (2) of Theorem 4.1, p. 82. Hence, 
L = f (c) , and so 9 (h) = f (c + h) — f (c) — /' (c) h. Therefore, (3) implies (1), and the theorem is 
proved. 

4.4: 

REMARK Though it seems artificial and awkward, Condition (3) of this theorem is very conve- 
nient for many proofs. One should remember it. 

Exercise 4.6 

a. What is the domain of the function 9 of condition (3) in the preceding theorem? Is in this 
domain? Are there any points in the interior of this domain? 

b. Let L and 9 be as in part (3) of the preceding theorem. Prove that, given an e > there 
exists a S > such that if \h\ < 5 then \9 (h) | < e\h\. 

Theorem 4.3: 

If / : S — > C is a function, either of a real variable or a complex variable, and if / is differentiable 
at a point c of S, then / is continuous at c. That is, differentiability implies continuity. 
Proof: 

We are assuming that lirrih-*o (f(c+h) — f (c)) /h = L. Hence, there exists a positive number 5q 
such that | /(c+/0-/(c) _ L | < 1 if ^ < ^ implymg that |j ( c + fy _ f ( c ) | <• |/j| (|L| + l)whenever 
\h\ < So. So, if e > is given, let 8 be the minimum of So and e/ (\L\ + 1) . If y G S and \y — c\ < S, 
then, thinking of y as being c + h, 

1/(1/) -f(c)\ = \f(c+h)-f(c)\< \h\ (\L\ + 1) = \y-c\ (\L\ + 1) < e. (4.20) 

(Every y can be written as c + h for some h, and \y — c\ = \h\.) 

Exercise 4.7 

Define / (z) = \z\ for z £ C. 
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a. Prove that / is continuous at every point of C. 

b. Show that, if / is differentiable at a point c, then /' (c) = 0. HINT: Using part (b) of 
Theorem 4.1, p. 82, evaluate /' (c) in the following two ways. 

/ ( c ) = Um |c+ " i l ~ |c| (4.21) 



and 



f(c)= lim lc+ ' n j |C| . (4.22) 

n — >oo — 



Show that the only way these two limits can be equal is for them to be 0. 

c. Conclude that / is not differentiable anywhere. Indeed, if it were, what would the function 8 
have to be, and why wouldn't it satisfy (4.14)? 

d. Suppose / : R — » R is the function of a real variable that is defined by / (x) = |x|. Show that 
/ is differentiable at every point x ^ 0. How does this result not contradict part (c)? 

The following theorem generalizes the preceding exercise. 

Theorem 4.4: 

Suppose / : S — > R is a real-valued function of a complex variable, and assume that / is differen- 
tiable at a point c € S. Then /' (c) = 0. That is, every real-valued, differentiable function / of a 
complex variable satisfies /' (c) = for all c in the domain of /'. 
Proof: 

We compute / (c) in two ways. 

/ (c) = lim —-, is a real number.. (4.23) 

n — 

n 

f (c) = lim —. is a purely imaginary number. (4.24) 

n — 

n 

Hence, /'(c) must be 0, as claimed. 

4.5: 

REMARK This theorem may come as a surprise, for it shows that there are very few real- valued 
differentiable functions of a complex variable. For this reason, whenever / : S — > i? is a real- valued, 
differentiable function, we will presume that / is a function of a real variable; i.e., that the domain 
SCR. 

Evaluating Urrih-^oQ CO m the two different ways, h real, and h pure imaginary, led to the proof 
of the last theorem. It also leads us to make definitions of what are called "partial derivatives" of 
real- valued functions whose domains are subsets of C = R 2 . As the next exercise will show, the 
theory of partial derivatives of real- valued functions is a much richer theory than that of standard 
derivatives of real- valued functions of a single complex variable. 

Definition 4.3: 

Let / : S — > R be defined on a set S C C = R 2 , and let c = (a, b) = + + bi be a point in the 
interior of S. We define the partial derivative of f with respect to x at the point c = (a, b) by the 
formula 

Half. f(a+h,b)-f(a,b) 

—(a,b)=hm , (4.25) 

tialx v ' h^a h v ; 
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and the partial derivative of f with respect to y at c = (a, b) by the formula 

whenever these limits exist. (In both these limits, the variable /i is a real variable.) ( 

It is clear that the partial derivatives of a function arise when we fix either the real part of the 
variable or the imaginary part of the variable to be a constant, and then consider the resulting 
function of the other (real) variable. We will see in Exercise 4.8 that there is a definite difference 
between a function's being differentiable at a point c = (a+ bi) in the complex plane C versus its 
having partial derivatives at the point (a, 6) in R 2 . 

Exercise 4.8 

a. Suppose / is a complex-valued function of a complex variable, and assume that both the real 
and imaginary parts of / are differentiable at a point c. Show that / is differentiable at c and 
that /' (c) = 0. 

b. Let / = u + iv be a complex- valued function of a complex variable that is differentiable at a 
point c. Prove that both partial derivatives of u and v exist at c = (a, b) , and in fact that 

tialu , . tialv ,,„>,, ,.„„■, 

T^(c)+i-—(c) = f(c) (4.27) 

tialx tialx 

and 

tialu , , .tialv ,..,.,. , , „„. 

7^(c) + *7^(c) = */ (c . (4.28) 

tialy tialy 

c. Define a complex-valued function / on C = R 2 by / (z) = f (x + iy) = x—iy. Write / = u+iv, 
and show that both partial derivatives of u and v exist at every point, but that / is not a 
differentiable function of the complex variable z. 

The next theorem is, in part, what we call in calculus the "differentiation formulas." 

Theorem 4.5: 

Let / and g be functions (either of a real variable or a complex variable), which are both differen- 
tiable at a point c. Let a and b be complex numbers. Then: 

1. af + bg is differentiable at c, and (af + bg) (c) = af (c) + bg (c) . 

2. (Product Formula) fg is differentiable at c, and (fg) (c) = /' (c) g (c) + / (c) g (c) . 

3. (Quotient Formula) f/g is differentiable at c (providing that g (c) ^ 0), and 

L)\ c) = 9(Q)f'(c)-f(c)g(c) 
9J (9(c)) 2 

If / = u + iv is a complex- valued function, then / is differentiable at a point c if and only if 
u and v are differentiable at c, and /' (c) = u (c) + iv (c) . 
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Proof: 

We prove part (2) and leave parts (1), (3), and (4) for the exercises. We have 

(fg)(c+h)-(fg)(c) _ ,. f(c+h)g(c+h)-f(c)g(c) 



lim U9l ^'T um> = Um 



h->0 h /i->0 



lim 



f(c+h)g(c+h)-f(c)g(c+h) 



h^Q h 



lim 



f{c)g{c+h)-f(c)g(c) 



h^O h 



(4.30) 



= lim— — I limn (c+ h) 
+ Umf{c)Um 9{c+h) - 9{c) 
f'(c)g(c) + f(c)g(c), 
where we have used Theorem 4.1, p. 82, Theorem 4.2, p. 84, and Theorem 4.3, p. 85. 

Exercise 4.9 

a. Prove parts (1), (3), and (4) of Theorem 4.5, p. 87. 

b. If / and g are real- valued functions that are differentiable at a point c, what can be said about 
the differentiability of max (/, g)l 

c. Let / be a constant function / (z) = k. Prove that / is differentiable everywhere and that 
/' (z) = for all z. 

d. Define a function / by / (z) = z. Prove that / is differentiable everywhere and that /' (z) = 1 
for all z. 

e. Verify the usual derivative formulas for polynomial functions: If p(z) = J22=o akzk > then 
V (z) = ELi^fc 2 ^ 1 - 

What about power series functions? Are they differentiable functions? If so, are their derivatives again 
power series functions? In fact, everything works as expected. 

Theorem 4.6: 

Let / be a power series function / (z) = J2^Lo a « 2; ™ having radius of convergence r > 0. Then / 
is differentiable at each point z in its open disk B r (0) of convergence, and 

oo oo 

f{z) = Y j na n z n - 1 = Y,na n z n -\ (4.31) 

n— n—1 

Proof: 

The proof will use part (3) of Theorem 4.2, p. 84. Fix an z with \z\ < r. Choose r so that 
\z\ < r < r, and write a for r — \z\, i.e., \z\ +a = r . Note first that the infinite series X^^Lo l a «l r ' 
converges to a positive number we will call M. Also, from the Cauchy-Hadamard Formula, we know 
that the power series function Yl na n w n has the same radius of convergence as does /, and hence 
the infinite series Y2,na n z n ~ 1 converges to a number we will denote by L. We define a function 8 
by 9 (h) = f (z + h) — f (z) — Lh from which it follows immediately that 

f(z + h)-f(z)=Lh + 0(h), (4.32) 

which establishes (4.13). To complete the proof that / is differentiable at z, it will suffice to 
establish (4.14), i.e., to show that 

9(h) , 

lim^ r - L =0. 4.33 

fe—o h 
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That is, given e > we must show that there exists a 5 > such that if < \h\ < S then 

\0( h) / h \ = \ f{Z + h l- f{z) -L\<e. (4.34) 

Assuming, without loss of generality, that \h\ < a, we have that 

I h 

iE~o°n(E; = o(;)»"-*h fc )-E^ =0 °n»" 



/(z+ft)-/(z) ^i _ | Er=o a "( z + fe )"-Er=o a » zT ' M 



ft 

I ft 

,i:~ian(E2_i(;)*"-*'»*) 



L\ 



E~ i «« (ELi (2) a"-***- 1 ) - E~ i "a.- : 



ft 

n— 1| 



= I E~ i an (ELi (2) ^*fc*- 1 ) - E~ i ( ") a**- 1 ! 
IE~ 2 on(EL 2 (2)^-^ fe - 1 )l 

< E^EIUKKDNrW -1 

< I^IE~2lonlEL 2 (i:)N n " fc l'*l fc " 2 



(4.35) 



in— /ci ifc — 2 

z\ \a\ 



< IME~2|On|EL 2 (2) 

< I^I^E~oKIELo(fc)l^rV 

|/»lsrE~ |On|(N+a) n 

W^E~=oM r '" 

so that if <5 = e/-^-, then |0 (/i) //i| < e, whenever \h\ < S, as desired. 

4.6: 

REMARK Theorem 4.6, p. 88 shows that indeed power series functions are differentiable, and 
in fact their derivatives can be computed, just like polynomials, by differentiating term by term. 
This is certainly a result we would have hoped was true, but the proof is not trivial. 

The next theorem, the Chain Rule, is another nontrivial one. It deals with the differentiability of the 
composition of two differentiable functions. Again, the result is what we would have wanted, the composition 
of two differentiable functions is itself differentiable, but the argument required to prove it is tricky. 

Theorem 4.7: Chain Rule 

Let / : S — > C be a function, and assume that / is differentiable at a point c. Suppose g : T — » C 
is a function, that T C C, that the number / (c) e T, and that g is differentiable at / (c) . Then the 
composition g o / is differentiable at c and 

(9°f)'(c)=g'(f(c))f'(c). (4.36) 

Proof: 

Using part (3) of Theorem 4.2, p. 84, write 

g (/ (c) + k)-g(f (c)) = L g k + g (k) (4.37) 

and 

f(c + h)-f(c)=L f h + Of(h). (4.38) 
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We know from that theorem that L g = g (/ (c)) and Lf = /' (c) . And, we also know that 

9„(k) , 9t(h) 

lim-^- =0 and lim J -± J - = 0. 4.39 

fc^o fc fc->o h 

Define a function k (h) = f(c+h) — f(c). Then, by Theorem 4.3, p. 85, we have that 
lirrifi^ok (h) = 0. We will show that g o / is differentiable at c by showing that there exists a 
number L and a function 9 satisfying the two conditions of part (3) of Theorem 4.2, p. 84. Thus, 
we have that 

gof(c+h)-gof(c) =g(f(c+h))-g(f(c)) 
9(f(c) + k(h))-g(f(c)) 

L g k(h) + e g (k(h)) 

(4.40) 

L g (f(c + h)-f(c)) + e g (k(h)) 

L g (L f h+9 f (h)) + 9 g (k(h)) 

L g L f h + L g e f (h) + e g (k(h)). 

We define L = L g lf = g (/ (c)) /' (c) , and we define the function 9 by 

9(h)=L g 9 f (h)+9 g (k(h)). (4.41) 

By our definitions, we have established (4.13) 

gof(c+h)-gof(c) = Lh+d(h), (4.42) 

so that it remains to verify (4.14). 

We must show that, given e > 0, there exists a S > such that if < \h\ < 5 then \9 (h) /h\ < e. 
First, choose an e > so that 

\L g \e +\L f \e +e' 2 <e (4.3). (4.43) 

Next, using part (b) of Exercise 4.6, choose a 5' > such that if |fc| < 5' then \9 g (k) \ < e'\k\. 
Finally, choose 6 > so that if < \h\ < S, then the following two inequalities hold. \k (h) | < d' 
and \9j (h) \ < e'\h\. The first can be satisfied because / is continuous at c, and the second is a 
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consequence of part (b) of Exercise 4.6. Then: if < \h\ < S, 



\9(h)\ = \L g 6 f (h) + 9 g (k(h)) 
< 



\L g \\e f (h)\ + \e g (k(h))\ 

\L g \e\h\ + e\k(h)\ 

\L g \e'\h\+s'\f(c+h)-f(c)\ 
\L g \e'\h\+e'\L f h + e f (h)\ 

\L g \e'\h\+s'\L f \\h\+e'\9 f (h)\ 

\L g \e'\h\+e'\L f \\h\ + e'e'\h\ 

(\L g \s' + \L f \e' + e' 2 )\h\, 



< 



< 



< 



whence 



as desired. 



(4.44) 



\6 (h) /h\ < (\L g \e + \L f \e + e' 2 ) < s, (4.45) 



Exercise 4.10 

a. Derive the familiar formulas for the derivatives of the elementary transcendental functions: 

exp = exp, sin = cos, ,sinh = cosh, cosh = sinh&nd cos = —sin. (4.46) 

b. Define a function / as follows. / (z) = cos 2 (z) + sin 2 (z) . Use part (a) and the Chain Rule to 
show that /' (z) = for all z € C. Does this imply that cos 2 (z) + sin 2 (z) = 1 for all complex 
numbers zl 

c. Suppose / is expandable in a Taylor series around the point c :f (z) = J2"^Lo a n{ z — c) n for 
all z € B r (c) . Prove that / is differentiable at each point of the open disk B r (c) , and show 
that 

oo 

f{z) = Y j na n {z-c) n -\ (4.47) 

71=1 

HINT: Use Theorem 4.6, p. 88 and the chain rule. 
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4.4 Consequences of Differentiability, the Mean Value Theorem 4 

Definition 4.4: 

Let / : S — > R be a real- valued function of a real variable, and let c be an element of the interior of 
S. Then / is said to attain a local maximum at c if there exists a 5 > such that (c — 5, c + 5) C S 
and / (c) > / (x) for all a; € (c — 5, c + J) . 

The function / is said to attain a local minimum at c if there exists an interval (c — 5, c + 5) C S 
such that / (c) < / (x) for all x € (c — 6, c + 6) . 
The next theorem should be a familiar result from calculus. 

Theorem 4.8: First Derivative Test for Extreme Values 

Let / : S — > R be a real- valued function of a real variable, and let c € S be a point at which / 
attains a local maximum or a local minimum. If / is differentiable at c, then /' (c) must be 0. 
Proof: 

We prove the theorem when / attains a local maximum at c. The proof for the case when / attains 
a local minimum is completely analogous. 

Thus, let 5 > be such that / (c) > / (x) for all x such that \x — c\ < S. Note that, if n is 
sufficiently large, then both c+ - and c— - belong to the interval (c — S,c + 6) . We evaluate /' (c) 



in two ways. First, 



/' (c) = lim f ( C+ n ' f -^L < (4.48) 



because the numerator is always nonpositive and the denominator is always positive. On the other 
hand, 

f (c- ±) - f(c) 
f (c) = lim-±- — ^ — -^ > (4.49) 

n — - 

n 

since both numerator and denominator are nonpositive. Therefore, /' (c) must be 0, as desired. 

Of course we do not need a result like Theorem 4.8, First Derivative Test for Extreme Values, p. 92 
for functions of a complex variable, since the derivative of every real- valued function of a complex variable 
necessarily is 0, independent of whether or not the function attains an extreme value. 

4.7: 

REMARK As mentioned earlier, the zeroes of a function are often important numbers. The 
preceding theorem shows that the zeroes of the derivative /' of a function / are intimately related 
to finding the extreme values of the function /. The zeroes of /' are often called the critical points 
for /. Part (a) of the Exercise 4.11 establishes the familiar procedure from calculus for determining 
the maximum and minimum of a continuous real-valued function on a closed interval. 

Exercise 4.11 

a. Let / be a continuous real- valued function on a closed interval [o, b] , and assume that / is 
differentiable at each point x in the open interval (a, b) . Let M be the maximum value of 
/ on this interval, and m be its minimum value on this interval. Write S for the set of all 
x g (a, b) for which /' (x) = 0. Suppose a: is a point of [a, b] for which / (x) is either M or m. 
Prove that x either is an element of the set S, or x is one of the endpoints a or b. 

b. Let / be the function defined on [0,1/2) by f (t) = i/(l - t) . Show that / (t) < 1 for all 

te [0,1/2). 

c. Let t € (—1/2, 1/2) be given. Prove that there exists an r < 1, depending on t, such that 
\t/ (1 + y) | < r for all y between and t. 



4 This content is available online at <http://cnx.Org/content/m36203/l.2/>. 
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d. Let t be a fixed number for which < t < 1. Show that, for all < s < t,(t — s) j (1 + s) < t. 

Probably the most powerful theorem about differentiation is the next one. It is stated as an equation, but 
its power is usually as an inequality; i.e., the absolute value of the left hand side is less than or equal to the 
absolute value of the right hand side. 

Theorem 4.9: Mean Value Theorem 

Let / be a real- valued continuous function on a closed bounded interval [a, b] , and assume that / 
is differentiable at each point x in the open interval (a, b) . Then there exists a point c € (a, b) such 
that 

/(&)-/ (a) = /»(&- a). (4.50) 

Proof: 

This proof is tricky. Define a function h on [a, b] by 

h (x) = x (/ (b) - f (a)) - / (x) (b-a). (4.51) 

Clearly, h is continuous on [a, b] and is differentiable at each point x € (a, b) . Indeed, 

ti (x) = f(b)-f (a) - f (x) (b-a). (4.52) 

It follows from this equation that the theorem will be proved if we can show that there exists a 
point c € (a, b) for which K (c) = 0. Note also that 

h (a) = a (f (b) - f (a)) - f (a) (b - a) = af (b) - bf (a) (4.53) 

and 

h (b) = b (f (b) - f (a)) - f (b) (b-a) = af (b) - bf (a) , (4.54) 

showing that h(a) = h (b) . 

Let m be the minimum value attained by the continuous function h on the compact interval 
[a, b] and let M be the maximum value attained by h on [a, b] . If m = M, then h is a constant on 
[a, b] and h' (c) = for all c € (a, b) . Hence, the theorem is true if M = m, and we could use any 
c g (a, b) . If m / M, then at least one of these two extreme values is not equal to h (a) . Suppose 
m y^ h (o) . Of course, m is also not equal to h (b) . Let c € [a, b] be such that h (c) = m. Then, in 
fact, c g (a, b) . By Theorem 4.8, First Derivative Test for Extreme Values, p. 92, h' (c) = 0. 

We have then that in every case there exists a point c s (a, b) for which h' (c) = 0. This completes 
the proof. 

4.8: 

REMARK The Mean Value Theorem is a theorem about real- valued functions of a real variable, 
and we will see later that it fails for complex-valued functions of a complex variable. (See part (f) 
of Exercise 4.16.) In fact, it can fail for a complex-valued function of a real variable. Indeed, if 
/ (x) = u (x)+iv (x) is a continuous complex- valued function on the interval [a, b] , and differentiable 
on the open interval (a, b) , then the Mean Value Theorem certainly holds for the two real- valued 
functions u and v, so that we would have 

f{b)-f (a) = u (b) - u (a) + i (v (6) - v (a)) = u (ci) (6 - a) + iv (c 2 ) {b-a), (4.55) 

which is not /' (c) (b — a) unless we can be sure that the two points c\ and c^ can be chosen to be 
equal. This simply is not always possible. Look at the function / (x) = x 2 

[0,1]. 
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On the other hand, if / is a real- valued function of a complex variable (two real variables), then 
a generalized version of the Mean Value Theorem does hold. See part (c) of Exercise 4.35. 

One of the first applications of the Mean Value Theorem is to show that a function whose 
derivative is identically is necessarily a constant function. This seemingly obvious fact is just not 
obvious. The next exercise shows that this result holds for complex- valued functions of a complex 
variable, even though the Mean Value Theorem does not. 

Exercise 4.12 

a. Suppose / is a continuous real- valued function on (a, 6) and that /' (x) = for all x g (a, b) . 
Prove that / is a constant function on (a, b) . HINT: Show that / (x) = f (a) for all x g [a, b] 
by using the Mean Value Theorem applied to the interval [a, x] . 

b. Let / be a complex-valued function of a real variable. Suppose / is different iable at each 
point x in an open interval (a, 6) , and assume that /' (a;) = for all x g (a, b) . Prove that / 
is a constant function. HINT: Use the real and imaginary parts of /. 

c. Let / be a complex- valued function of a complex variable, and suppose that / is differentiable 
on a disk B r (c) C C, and that /' (z) = for all z g B r (c) . Prove that / (z) is constant on 
B r (c) . HINT: Let z be an arbitrary point in B r (c) , and define a function h : [0, 1] — ► C by 
h (t) = f ((1 - t) c + tz) . Apply part (b) to h. 

The next exercise establishes, at last, two important identities. 

Exercise 4.13 

(cos 2 + sin = 1 and exp(iir = — 1.) 

a. Prove that cos 2 (z) + sin (z) = 1 for all complex numbers z. 

b. Prove that cos(tt) = — 1. HINT: We know from part (a) that cos (n) = ±1. Using the Mean 
Value Theorem for the cosine function on the interval [0,7r] , derive a contradiction from the 
assumption that cos (it) = 1. 

c. Prove that exp(iir) = — 1. HINT: Recall that exp(iz) = cos (z) + isin(z) for all complex z. 
(Note that this does not yet tell us that e t7r = — 1. We do not yet know that exp(z) = e z .) 

d. Prove that cosh z — sinh z = 1 for all complex numbers z. 

e. Compute the derivatives of the tangent and hyperbolic tangent functions tan = sin/ cos and 
tanh = sinh/ cosh. Show in fact that 

tan = and tanh = k. (4.56) 

cos z cosh 

Here are two more elementary consequences of the Mean Value Theorem. 
Exercise 4.14 

a. Suppose / and g are two complex- valued functions of a real (or complex) variable, and suppose 
that /' (x) = g (x) for all x s (a, b) (or x s B r (c) .) Prove that there exists a constant k such 
that / (x) = g (x) + k for all x s (a, b) (or x s B r (c) .) 

b. Suppose / (z) = cexp(az) for all z, where c and a are complex constants with a/0. Prove 
that there exists a constant c such that / (z) = -exp (az) + c . What if a = 0? 

c. (A generalization of part (a)) Suppose / and g are continuous real- valued functions on the 
closed interval [a, b] , and suppose there exists a partition {xo < x\ < ... < x n } of [a, b] 
such that both / and g are differentiable on each subinterval (x,_i,Xj) . (That is, we do not 
assume that / and g are differentiable at the endpoints.) Suppose that /' (x) = g (x) for 
every x in each open subinterval (x,_i,Xj) . Prove that there exists a constant k such that 
/ (x) = g (x) + k for all x s [a, b] . HINT: Use part (a) to conclude that / = g + h where h is 
a step function, and then observe that h must be continuous and hence a constant. 

d. Suppose / is a differentiable real- valued function on (a, b) and assume that /' (x) / for all 
x g (o, b) . Prove that / is 1-1 on (a, b) . 
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Exercise 4.15 

Let / : [a, b] — » R be a function that is continuous on its domain [a, b] and differentiable on (a, b) . 
(We do not suppose that /' is continuous on (a, b) .) 

a. Prove that / is nondecreasing on [a,b] if and only if /' (x) > for all x € (a, b) . Show also 
that / is nonincreasing on [a, b] if and only if /' (x) < for all x € (a, b) . 

b. Conclude that, if /' takes on both positive and negative values on (o, b) , then / is not 1-1. 
(See the proof of Theorem 3.11, p. 67.) 

c. Show that, if /' takes on both positive and negative values on (o, b) , then there must exist 
a point c G (a, b) for which /' (c) = 0. (If /' were continuous, this would follow from the 
Intermediate Value Theorem. But, we are not assuming here that /' is continuous.) 

d. Prove the Intermediate Value Theorem for Derivatives: Suppose / is continuous on the closed 
bounded interval [a, b] and differentiable on the open interval (a, b) . If /' attains two distinct 
values vi = f (x\) < V2 = / (#2) , then /' attains every value v between v\ and v-i- HINT: 
Suppose v is a value between v\ and vi- Define a function g on [a, b] by g (x) = f (x) — vx. 
Now apply part (c) to g. 

Here is another perfectly reasonable and expected theorem, but one whose proof is tough. 

Theorem 4.10: Inverse Function Theorem 

Suppose / : (a, b) — > R is a function that is continuous and 1-1 from (a, b) onto the interval (a', b') . 
Assume that / is differentiable at a point c € (a, b) and that /' (c) / 0. Then / _1 is differentiable 
at the point / (c) , and 

f- 1 '(f(c)) = j^. (4.57) 

Proof: 

The formula / _1 (/(c)) = 1//' (c) is no surprise. This follows directly from the Chain Rule. 
For, if / _1 (/ (x)) = x, and / and / _1 are both differentiable, then / _1 (/ (c)) /' (c) = 1, which 
gives the formula. The difficulty with this theorem is in proving that the inverse function / _1 of 
/ is differentiable at / (c) . In fact, the first thing to check is that the point / (c) belongs to the 
interior of the domain of / _1 , for that is essential if / _1 is to be differentiable there, and here is 
where the hypothesis that / is a real- valued function of a real variable is important. According to 
Exercise 3.12, the 1-1 continuous function / maps [a, b] onto an interval [a', b'~\ , and / (c) is in the 
open interval (a , 6) , i.e., is in the interior of the domain of / _1 . 

According to part (2) of Theorem 4.2, p. 84, we can prove that / _1 is differentiable at / (c) by 
showing that 

Um f-H*)-f-Hf(c)) = ^_ 

x-/(c) a; -/(c) /'(c) 

That is, we need to show that, given an e > 0, there exists a S > such that if < \x — f (c)\ < S 
then 

,/ _1 (x) - r 1 (/ '(c)) 1 , 

' Jm ~m x< '- (459) 

First of all, because the function 1/q is continuous at the point /' (c) , there exists an e > such 
that if \q — f (c) | < e , then 



96 CHAPTER 4. DIFFERENTIATION, LOCAL BEHAVIOR 

Next, because / is differentiable at c, there exists a 5' > such that if < \y — c\ < 6' then 

\ f{y) - f{c) -f'(c)\<e'. (4.61) 

y-c 

Now, by Theorem 3.10, p. 66, / _1 is continuous at the point /(c) , and therefore there exists a 
S > such that if \x — / (c) | < 5 then 

ir i (x)-r i (/( C )i< ( 5'(4.62) 

So, if \x - f (c) | < S, then 

I/- 1 (a:) - c\ = \r l (x) - r 1 (/ (c)) \<S\ (4.63) 

But then, by (4.61), 

i 7 TO : e /(e) -/•(■*!<■•■ <«*> 

from which it follows, using (4.60), that 

J" 1 (a)-/" 1 (/(c)) 1 



a: -/(c) /'(c) 

as desired. 



< e, (4.65) 



4.9: 

REMARK A result very like Theorem 4.10, Inverse Function Theorem, p. 95 is actually true for 
complex-valued functions of a complex variable. We will have to show that if c is in the interior of 
the domain S of a one-to-one, continuously differentiable, complex- valued function / of a complex 
variable, then / (c) is in the interior of the domain / (S) of / _1 . But, in the complex variable case, 
this requires a somewhat more difficult argument. Once that fact is established, the proof that 
Z" 1 is differentiable at / (c) will be the same for complex- valued functions of complex variables as 
it is here for real- valued functions of a real variable. Though the proof of Theorem 4.10, Inverse 
Function Theorem, p. 95 is reasonably complicated for real-valued functions of a real variable, 
the corresponding result for complex functions is much more deep, and that proof will have to be 
postponed to a later chapter. See Theorem 7.10, Open Mapping Theorem, p. 198. 

4.5 The Exponential and Logarithm Functions 5 

We derive next the elementary properties of the exponential and logarithmic functions. Of course, by 
"exponential function," we mean the power series function exp. And, as yet, we have not even defined a 
logarithm function. 

Exercise 4.16 

a. Define a complex-valued function / : C — > C by / (z) = exp (z) exp {—z) . Prove that / (z) = 1 
for all z e C. 

b. Conclude from part (a) that the exponential function is never 0, and that exp(-z) = 
1 J exp (z) . 

c. Show that the exponential function is always positive on R, and that lirrix^-ooexp (x) = 0. 

d. Prove that exp is continuous and 1-1 from (— oo, oo) onto (0, oo) . 

e. Show that the exponential function is not 1-1 on C. 



5 This content is available online at <http://cnx.Org/content/m36199/l.2/>. 
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f. Use parts b and e to show that the Mean Value Theorem is not in any way valid for complex- 
valued functions of a complex variable. 

Using part (d) of the preceding exercise, we make the following important definition. 

Definition 4.5: 

We call the inverse exp~ l of the restriction of the exponential function to R the (natural) logarithm 
function, and we denote this function by In. 

The properties of the exponential and logarithm functions are strongly tied to the simplest kinds of 
differential equations. The connection is suggested by the fact, we have already observed, that exp = exp. 
The next theorem, corollary, and exercises make these remarks more precise. 

Theorem 4.11: 

Suppose / : C — > C is differentiable everywhere and satisfies the differential equation /' = af, 
where a is a complex number. Then / (z) = cexp (az) , where c = / (0) . 
Proof: 

Consider the function h(z) = f (z) /exp (az) . Using the Quotient Formula, we have that 

h ' ,, = exp (az) f (z) - aexp (az) f (z) = exp(az) (/' (z) - af (z)) = q 

[exp(az)} [exp (z)] 

Hence, there exists a complex number c such that h(z) = c for all z. Therefore, / (z) = cexp (az) 
for all z. Setting z = gives / (0) = c, as desired. 

Corollary 4.1: Law of Exponents 

For all complex numbers z and w,exp (z + w) = exp (z) exp (w) . 
Proof: 

Fix w, define f (z) = exp(z + w), and apply the preceding theorem. We have /' (z) = 
exp (z + w) = f (z) , so we get 

exp (z + w) = f (z) = f (0) exp (z) = exp (w) exp (z) . (4.67) 



Exercise 4.17 

a. If n is a positive integer and z is any complex number, show that exp(nz) = (exp(z)) . 

b. If r is a rational number and x is any real number, show that exp(rx) = (exp(x)) . 

Exercise 4.18 

a. Show that In is continuous and 1-1 from (0,oo) onto R. 

b. Prove that the logarithm function In is differentiable at each point y s (0, oo) and that 
In (y) = 1/y. HINT: Write y = exp(c) and use Theorem 4.10. 

c. Derive the first law of logarithms: In (xy) = In (x) + In (y) . 

d. Derive the second law of logarithms: That is, if r is a rational number and a; is a positive real 
number, show that In (x r ) = rln (x) . 

We are about to make the connection between the number e and the exponential function. The next theorem 
is the first step. 

Theorem 4.12: 

ln(l) = and In (e) = 1. 
Proof: 

If we write 1 = exp (t) , then t = In (1) . But exp (0) = 1, so that In (1) = 0, which establishes the 
first assertion. 
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Recall that 



e = lim[ 1+ - I . (4.68) 



n 



Therefore, 



ln(e) = In (lim(l + £)") 
= limln((l + ^) n ) 
= liranln (l H — ) 



(4.69) 



(4.70) 



= hm — x 

n n 

,. in(l+i)-in(l) 

= lim — t 

in'(l) 
1/1 
1. 
This establishes the second assertion of the theorem. 

Exercise 4.19 

a. Prove that 

^ n! 

HINT: Use the fact that the logarithm function is 1-1. 

b. For r a rational number, show that exp(r) = e r . 

c. If a is a positive number and r = p/q is a rational number, show that 

a r = exp (rln (a)) . (4-71) 

d. Prove that e is irrational. HINT: Let p n /q n be the nth partial sum of the series in part 
(a). Show that q n < n\, and that limq n (e — p n /q n ) = 0. Then use Theorem 2.19, Test for 
Irrationality, p. 52. 

We have finally reached a point in our development where we can make sense of raising any positive number 
to an arbitrary complex exponent. Of course this includes raising positive numbers to irrational powers. We 
make our general definition based on part (c) of the preceding exercise. 

Definition 4.6: 

For a a positive real number and z an arbitrary complex number, define a z by 

a z = exp {zln (a)) . (4.72) 

4.10: 

REMARK The point is that our old understanding of what a r means, where a > and r is a 
rational number, coincides with the function exp (rln (a)) . So, this new definition of a z coincides 
and is consistent with our old definition. And, it now allows us to raies a positive number o to an 
arbitrary complex exponent. 
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4.11: 

REMARK Let the bugles sound!! Now, having made all the appropriate definitions and derived 
all the relevant theorems, we can finally prove that e i7r = — 1. From the definition above, we see 
that if a = e, then we have e z = exp(z) . Then, from part (c) of Exercise 4.13, we have what we 
want: 

e™ = -1. (4.73) 

Exercise 4.20 

a. Prove that, for all complex numbers z and w,e z+w = e z e w . 

b. If x is a real number and z is any complex number, show that 

{e x f = e xz . (4.74) 

c. Let a be a fixed positive number, and define a function / : C — > C by / (z) = a z . Show that 
/ is different iable at every z € C and that /' (z) = In (a) a z . 

d. Prove the general laws of exponents: If o and 6 are positive real numbers and z and w are 
complex numbers, 

(4.75) 



a z+w 


= a z a w 


a z b z 


= (ab) z .. 


a xw -- 


= (a x ) w . 



(4.76) 

and, if x is real, 

(4.77) 

e. If y is a real number, show that \e ly \ = 1. If z = x + iy is a complex number, show that 
\e z \ = e x . 

f. Let a = a + bi be a complex number, and define a function / : (0, oo) — > C by / (x) = x a = 
e aln ( x > . Prove that / is differentiable at each point x of (0, oo) and that /' (x) = ax"' 1 . 

g. Let a = a + bi be as in part (f). For x > 0, show that \x a \ = x a . 



4.6 The Trigonometric and Hyperbolic Functions 6 

The laws of exponents and the algebraic connections between the exponential function and the trigonometric 
and hyperbolic functions, give the following "addition formulas:" 

Theorem 4.13: 

The following identities hold for all complex numbers z and w. 

sin (z + w) = sin (z) cos (w) + cos (z) sin (w) . (4.78) 

cos (z + w) = cos (z) cos (w) — sin (z) sin (w) . (4.79) 

sink (z + w) = sink (z) cosh (w) + cosh (z) sinh (w) . (4.80) 



6 This content is available online at <http://cnx.Org/content/m36196/l.2/>. 
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cosh (z + w) = cosh (z) cosh (w) + sinh (z) sinh (w) . (4-81) 

Proof: 

We derive the first formula and leave the others to an exercise. 
First, for any two real numbers x and y, we have 

cos (x + y) + isin (x + y) = e z ( x +y) 

e ix e iy 

= (cosx + isinx) x (cosy + isiny) 

= cosxcosy — sinxsiny + i (cosxsiny + sinxcosy) , 

which, equating real and imaginary parts, gives that 



(4.82) 



cos (x + y) = cosxcosy — sinxsiny (4.83) 

and 

sin (x + y) = sinxcosy + cosxsiny. (4.84) 

The second of these equations is exactly what we want, but this calculation only shows that it 
holds for real numbers x and y. We can use the Identity Theorem to show that in fact this formula 
holds for all complex numbers z and w. Thus, fix a real number y. Let / (z) = sinzcosy + coszsiny, 
and let 

g (z) = sin < z + y) = - ( e i{z+v) - e~ i(z+v) = - (e iz e ly - e~ iz e~ ly ) .(4.85) 
2i \ 2i 

Then both / and g are power series functions of the variable z. Furthermore, by the previous 
calculation, /(1/fc) = g(l/k) for all positive integers k. Hence, by the Identity Theorem, / (z) = 
g (z) for all complex z. Hence we have the formula we want for all complex numbers z and all real 
numbers y. 

To finish the proof, we do the same trick one more time. Fix a complex number z. Let / (w) = 
sinzcosw + coszsinw, and let 

g (w) = sin (z + w) = - ( e l{z+w ^ - e ~ l( - z+w ^ = - (e lz e lw - e~ iz e~ iw ) .(4.86) 

Again, both / and g are power series functions of the variable w, and they agree on the sequence 
{1/fc}. Hence they agree everywhere, and this completes the proof of the first addition formula. 

Exercise 4.21 

a. Derive the remaining three addition formulas of the preceding theorem. 

b. From the addition formulas, derive the two "half angle" formulas for the trigonometric func- 
tions: 

• 2 / n 1 — cos (2z) . , „„. 

sm 2 (z)= ^-L, (4.87) 



and 

cos 1 (z) = ' """^-"\ (4. 88 ) 



., 1 + cos (2z) 
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Theorem 4.14: 

The trigonometric functions sin and cos are periodic with period 2n; i.e., sin (z + 27r) = sin(z) 
and cos (z + 2tt) = cos (z) for all complex numbers z. 
Proof: 

We have from the preceding exercise that sin (z + 2tt) = sin (z) cos (2tt) + cos (z) sin (2ir) , so 
that the periodicity assertion for the sine function will follow if we show that cos (2tt) = 1 and 
sin (2ir) = 0. From part (b) of the preceding exercise, we have that 

o , , 1 — cos (2ir) 
= sin 2 (tt) = ^ — '- (4.89) 

which shows that cos (2tt) = 1. Since cos 2 + sin 2 = 1, it then follows that sin (2n) = 0. 
The periodicity of the cosine function is proved similarly. 

Exercise 4.22 

a. Prove that the hyperbolic functions sinh and cosh are periodic. What is the period? 

b. Prove that the hyperbolic cosine cosh (x) is never for x a real number, that the hyperbolic 
tangent tanh(x) = sinh (x) /cosh (x) is bounded and increasing from R onto (—1,1), and 

that the inverse hyperbolic tangent has derivative given by tanh^ 1 (y) = 1/ (l — y 2 ) . 

c. Verify that for all y € (— 1, 1) 



tanh- 1 (y) = In \J\^r. ) (4.90) 



y 

Exercise 4.23: Polar coordinates 

Let z be a nonzero complex number. Prove that there exists a unique real number < 6 < 2tt 
such that z = re 10 , where r = \z\. 

HINT: If z = a + bi, then z = r (f + H. Observe that -1 < f < 1,-1 < £ < 1, and 

(r) + ( ) = 1- Sh° w that there exists a unique < 6 < 2ir such that - = cosO and - = sinO. 



4.7 L'Hopital's Rule 7 

Many limits of certain combinations of functions are difficult to evaluate because they lead to what's known 
as "indeterminate forms." These are expressions of the form 0/0,oo/oo,0°,oo — oo,l°°, and the like. They are 
precisely combinations of functions that are not covered by our limit theorems. See Theorem 4.1, p. 82. The 
very definition of the derivative itself is such a case: linih-to (f(c+h) — f (cj) = 0,limh^oh = 0, and we are 
interested in the limit of the quotient of these two functions, which would lead us to the indeterminate form 
0/0. The definition of the number e is another example: lira (1 + 1/n) = l,limn = oo, and we are interested 
in the limit of (1 + l/n) n , which leads to the indeterminate form 1°°. L'Hopital's Rule, Theorem 4.16, p. 
102 below, is our strongest tool for handling such indeterminate forms. 

To begin with, here is a useful generalization of the Mean Value Theorem. 

Theorem 4.15: Cauchy Mean Value Theorem 

Let / and g be continuous real- valued functions on a closed interval [o, b] , suppose g (a) ^ g (b) , 
and assume that both / and g are differentiable on the open interval (a, b) . Then there exists a 
point c g (a, b) such that 

/»)-/(«) /» (491) 

g(b)-g(a) g (c) ' l " > 



7 This content is available online at <http://cnx.Org/content/m36201/l.2/>. 
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Exercise 4.24 

Prove the preceding theorem. 

HINT: Define an auxiliary function h as was done in the proof of the original Mean Value 
Theorem. 

The following theorem and exercise comprise what is called L'Hopital's Rule. 

Theorem 4.16: 

Suppose / and g are differentiable real- valued functions on the bounded open interval (a, b) and 
assume that 

lim 414 = L, (4.92) 

x^a+Qg (x) 

where L is a real number. (Implicit in this hypothesis is that g (x) ^ for x in some interval 
(a, a + a) .) Suppose further that either 

lim f (x) = lim g (x) = (4.93) 

x — >a+0 x — >a+0 



then 



Proof: 

Suppose first that 



lim f (x) = lim g(x) = oo. (4.94) 

x — >a+0 x — >a+0 



f (x) 
lim J -±4 = L. (4.95) 

x^a+0g[x) 



lim f (x) = lim g [x) = 0. (4.96) 

x — >a+0 x — >a+0 



Observe first that, because g (x) / for all x in some interval (a, a + a) ,g' (x) is either always 
positive or always negative on that interval. (This follows from part (d) of Exercise 4.15.) Therefore 
the function g must be strictly monotonic on the interval (a, a + a) . Hence, since lim x -+ a +o9 ( x ) = 
0, we must have that g (x) / on the interval (a, a + a) . 

Now, given an e > 0, choose a positive S < a such that if a < c < a + 5 then 1^44 — L\ < e. 
Then, for every natural number n for which l/n < 5, and every a < x < a + 5, we have by the 
Cauchy Mean Value Theorem that there exists a point c between a+ l/n and x such that 

\ fW-ff + W -L\ = m-L\<e. (4.97) 

1 g (x) - g (a + l/n) ' 5 (c) 

Therefore, taking the limit as n approaches oo, we obtain 

\ [S f\-L\= I™ \ n f ) - f \ a l] , r\ ~L\<e (4.98) 

g (X) n^oo g {x) — g (a + l/n) 

for all x for which a < x < a + S. This proves the theorem in this first case. 
Next, suppose that 

lim f (x) = lim g(x) = oo. (4.99) 

x — >a+0 x — >a+0 

This part of the theorem is a bit more complicated to prove. First, choose a positive a so that 
/ (x) and g (x) are both positive on the interval (a, a + a) . This is possible because both functions 
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are tending to infinity as x approaches a. Now, given an e > 0, choose a positive number (3 < a 
such that 

\ f -Jfl-L\< £ - (4.100) 

9 (c) 2 

for all a < c < a + j3. We express this absolute value inequality as the following pair of ordinary 
inequalities: 

L- £ -<0f\<L+ £ -. (4.101) 

2 g (c) 2 

Set y = a + /3. Using the Cauchy Mean Value Theorem, and the preceding inequalities, we have 
that for all a < x < y 

L -l< fJ rr IJ n< L+£ r ^ 

2 g{x)-g{y) 2 

implying that 

(L - |) (g (x) - g (y)) + f (y) < f (x) < (l + |) (g (x) - g (y)) + f (y) . (4.103) 

Dividing through by g (x) and simplifying we obtain 

L _l_ (^-f9(y) + m < m <L+ s (L + i),(y) + m (4104) 

2 g(x) g{x) g{x) 2 g (x) g (x) 

Finally, using the hypothesis that lim x ^ a +og i x ) = °°j an d the fact that L, e, g (y) , and / (y) are 
all constants, choose a 5 > 0, with 8 < /3, such that if a < x < a + 5, then 



(L-I)g(y) /(„) 



£ 



g(x) g(x) 2 

and 



< o (4.105) 



pr(x) g(x) 2 

Then, for all a < x < a + 5, we would have 



(*+!)'(¥), /(*).<*. (4.106) 



implying that 



and the theorem is proved. 



L-e< ^-f <L + e, (4.107) 



|/(a;) L|<e, (4.108) 



g(x) 



Exercise 4.25 

a. Show that the conclusions of the preceding theorem also hold if we assume that 

f (x) 
Urn Vr4 = oo. (4.109) 

x^a+0g (x) 

HINT: Replace e by a large real number B and show that / (x) /g (x) > B if < x — a < 5. 
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b. Show that the preceding theorem, as well as part (a) of this exercise, also holds if we replace 
the (finite) endpoint a by — oo. HINT: Replace the S's by negative numbers B. 

c. Show that the preceding theorem, as well as parts a and b of this exercise, hold if the limit as 
x approaches a from the right is replaced by the limit as x approaches b from the left. HINT: 
Replace / (a;) by / (-x) and g (x) by g (-x) . 

d. Give an example to show that the converse of L'Hopital's Rule need not hold; i.e., find 
functions / and g for which lim x ^ a +of (x) = lim x ^ a +og (x) = 0, 

f t x ) fix) 

Urn — — — exists, but lira , does not exist. (4.110) 

x^a+ogyx) x^a+og (x) 

e. Deduce from the proof given above that if lim x —> a +of {%) / 9 i x ) = L and lim x ^ a+ og (x) = 
oo, then lim x ^ a+ of {%) Id i x ) = L independent of the behavior of /. 

f. Evaluate lim x ^oo xl ^ X i and Zim x _>o(l — x ) ■ HINT: Take logarithms. 



4.8 Higher Order Derivatives 8 

Definition 4.7: 

Let S be a subset of R (or C), and Let / : S — > C be a function of a real (or complex) variable. We 
say that / is continuously differentiable on S° if / is differentiable at each point x of S° and the 
function /' is continuous on 5°. We say that / e C 1 (S) if / is continuous on S and continuously 
differentiable on S°. We say that / is 2-times continuously differentiable on S° if the first derivative 
/' is itself continuously differentiable on S°. And, inductively, we say that / is k-times continuously 
differentiable on S° if the k— 1st derivative of / is itself continuously differentiable on 5°. We write 
/( fc ) for the fcth derivative of /, and we write / e C k (S) if / is continuous on S and is k times 
continuously differentiable on S°. Of course, if / s C k (S) , then all the derivatives f^\ for j < k, 
exist nd are continuous on S°. (Why?) 

For completeness, we define /(°) to be / itself, and we say that / s C°° (S) if / is continuous 
on S and has infinitely many continuous derivatives on 5°; i.e., all of its derivatives exist and are 
continuous on S . 

As in Section 3.1, we say that / is real-analytic (or complex-analytic) on S if it is expandable 
in a Taylor series around each point c s S° 

4.12: 

REMARK Keep in mind that the definition above, as applied to functions whose domain S is a 
nontrivial subset of C, has to do with functions of a complex variable that are continuously differ- 
entiable on the set S°. We have seen that this is quite different from a function having continuous 
partial derivatives on 5°. We will return to partial derivatives at the end of this chapter. 

Theorem 4.17: 

Let S be an open subset of R (or C). 

1. Suppose WS is a subset of R. Then, for each k > 1, there exists a function in C k (S) that is 
not in C k+1 (5) . That is, C k+1 (S) is a proper subset of C k (5) . 

2. If / is real-analytic (or complex-analytic) on S, then / s C°° (5) . 

3. There exists a function in C°° (R) that is not real-analytic on R. That is, the set of real- 
analytic functions on R is a proper subset of the set C°° (R) . 



s This content is available online at <http://cnx.Org/content/m36192/l.2/>. 
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REMARK Suppose S is an open subset of C. It is a famous result from the Theory of Complex 
Variables that if / is in C l (S) , then / is necessarily complex analytic on S. We will prove this 
amazing result in Theorem 7.5, p. 192. Part (3) of the theorem shows that the situation is quite 
different for real- valued functions of a real variable. 
Proof: 

For part (1), see the exercise below. Part (2) is immediate from part (c) of Exercise 4.10. Before 
finishing the proof of part (3) , we present the following lemma: 

Lemma 4.1: 

Let / be the function defined on all of R as follows. 

V ( XI G~ 

f{x) = {Ox < FV ; x > (4-111) 

where p (x) is a fixed polynomial function and n is a fixed nonnegative integer. Then / is continuous 
at each point x of R. 
Proof: 

The assertion of the lemma is clear if x / 0. To see that / is continuous at 0, it will suffice to prove 
that 

Um p(x)e - * =0. (4.112) 

rr^O+0 X n 

(Why?) But, for x > 0, we know from part (b) of Exercise 3.22 that e 1/x > 1/ (x n+1 (n + 1)!) , 
implying that e^ 1 ^ < x n+1 (n + 1)!. Hence, for x > 0, 

1/ (*) I = HX) X 1/X <{n+ 1)!x|p (x) I' (4 - U3) 

and this tends to as a; approaches from the right, as desired. 

Returning to the proof of Theorem 4.17, p. 104, we verify part (3) by observing that if / is as in 
the preceding lemma then / is actually differentiable, and its derivative /' is a function of the same sort. 
(Why?) It follows that any such function belongs to C°° (R) . On the other hand, a nontrivial such / 
cannot be expandable in a Taylor series around because of the Identity Theorem. (Take Xk = — 1/fc.) This 
completes the proof. 

Exercise 4.26 

a. Prove part (1) of Theorem 4.17, p. 104. Use functions of the form x n sin(l/x) . 

b. Prove that any function of the form of the / in the lemma above is everywhere differentiable 
on R, and its derivative has the same form. Conclude that any such function belongs to 
C°° (R) . 

c. For each positive integer n, define a function /„ on the interval (—1, 1 by /„ (x) = \x\ . 
Prove that each /„ is differentiable at every point in (— 1, 1) , including 0. Prove also that the 
sequence {/ n } converges uniformly to the function / (x) = \x\. (See part (h) of Exercise 3.28.) 
Conclude that the uniform limit of differentiable functions of a real variable need not be 
differentiable. (Again, for functions of a complex variable, the situation is very different. In 
that case, the uniform limit of differentiable functions is differentiable. See Theorem 7.11, p. 
199.) 

Exercise 4.27: A smooth approximation to a step function. 

Suppose a < b < c < d are real numbers. Show that there exists a function \ m C°° (R) such that 
< X i x ) 5= 1 f° r a ll X -X ( x ) — 1 f° r x e [&i c ] i an d X i x ) — for x $. (a, d) . (If o is close to b and c 
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is close to d, then this function is a C°° approximation to the step function that is 1 on the interval 
[b, c] and elsewhere.) 

a. Let / be a function like the one in the lemma. Think about the graphs of the functions 
/ (x — c) and / (b — x) . Construct a C°° function g that is between b and c and positive 
everywhere else. 

b. Construct a C°° function h that is positive between a and d and everywhere else. 

c. Let g and h be as in parts (a) and (b). If j = g + h, show that j is never 0, and write k for 
the C°° function k = 1/j. 

d. Examine the function hk, and show that it is the desired function \. 

Theorem 4.18: Formula for the coefficients of a Taylor Series function 
Let / be expandable in a Taylor series around a point c : 

f(x)=J2a n (x-c) n . (4.114) 

Then for each n,a n = /(") (c) jn\. 
Proof: 

Because each derivative of a Taylor series function is again a Taylor series function, and because 
the value of a Taylor series function at the point c is equal to its constant term cto, we have that 
a\ = f (c) . Computing the derivative of the derivative, we see that 2a2 = / (c) = f^ (c) . 
Continuing this, i.e., arguing by induction, we find that n\a n = /(") (c) , which proves the theorem. 



4.9 Taylor Polynomials and Taylor's Remainder Theorem 9 

Definition 4.8: 

Let / be in C n (B r (c)) for c a fixed complex number, r > 0, and n a positive integer. Define the 
Taylor polynomial of degree n for / at c to be the polynomial T n = TJi ^ given by the formula: 

n 

( T (/,c)) W = I>^ - c )'< (4-115) 

j=o 

where aj = f^ (c) /j\. 

4.13: 

REMARK If / is expandable in a Taylor series on B r (c) , then the Taylor polynomial for / of 
degree n is nothing but the nth partial sum of the Taylor series for / on B r (c) . However, any 
function that is n times differentiable at a point c has a Taylor polynomial of order n. Functions 
that are infinitely differentiable have Taylor polynomials of all orders, and we might suspect that 
these polynomials are some kind of good approximation to the function itself. 

Exercise 4.28 

Prove that / is expandable in a Taylor series function around a point c (with radius of convergence 
r > 0) if and only if the sequence {TJi A of Taylor polynomials converges pointwise to /; i.e., 

f(z) = lim(T? M )(z) (4.116) 

for all z in B r (c). 
Exercise 4.29 

Let f eC n (B r (c)) . Prove that /' £ C^ 1 (B r (c)) . Prove also that (Tfi c) ) = T™7^. 



9 This content is available online at <http://cnx.Org/content/m36204/l.2/>. 
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The next theorem is, in many ways, the fundamental theorem of numerical analysis. It clearly 
has to do with approximating a general function by polynomials. It is a generalization of the Mean 
Value Theorem, and as in that case this theorem holds only for real-valued functions of a real 
variable. 

Theorem 4.19: Taylor's Remainder Theorem 

Let / be a real- valued function on an interval (c — r,c+ r) , and assume that / s C n ((c — r, c + r)) , 
and that /(") is differentiable on (c — r,c + r) . Then, for each x in (c — r,c+ r) there exists a y 
between c and x such that 

/ (*) - (T ( ™. c) ) (*) = ^^jf(* " cf +1 - (4.7) (4.117) 

REMARK If we write / (x) = T? c (x) + R n + 1 (x) , where R n +\ (x) is the error or remainder term, 
then this theorem gives a formula, and hence an estimate, for that remainder term. This is the 
evident connection with Numerical Analysis. 
Proof: 

We prove this theorem by induction on n. For n = 0, this is precisely the Mean Value Theorem. 
Thus, 

/ (a;) - T° c (x) = f(x)-f (c) = /' (y) (x - c.(4.118) 

Now, assuming the theorem is true for all functions in C n_1 ((c — r, c + r)) , let us show it is true for 

the given function / s C n ((c — r,c + r)) . Set g (x) = f (x)— (XT 1 , s j (x) and let h (x) = (x — c) n 

Observe that both g (c) = and h (c) = 0. Also, if x ^ c, then h (x) / 0. So, by the Cauchy Mean 
Value Theorem, we have that 



9Jx) = g 0*0 - g (c) = g (w) 
h (x) h (x) — h (c) ti (w) 
for some w between c and x. Now 



(4.119) 



g (w) = f («,) - (tft >c) ) ' («,) = /' («,) - (T^.7^) ( W ) (4.120) 

(See the preceding exercise.), and h' (w) = (n + 1) (w — c) n . Therefore, 



f( X )-(T™ c) )(x) g^ 

(x-c) n + 1 h(x) 



hTiw) (4.121) 



f{w) -{ T lf,)) M 

(n J rl)(w — c) n 

We apply the inductive hypotheses to the function /' (which is in C n ~ x ((c — r,c-\- r)) and obtain 



f( X )-(T lU )( x) /'(")- fe'.c))^) 

(a;-c)'»+ 1 __ (n+l)(iu-c)" 



/ ri| («) ( w - c )" 
(n+l)(u;-c)" (4.122) 



f M (v) 

(n+1)! 

/ ( " +1) fa) 

(n+1)! 



for some y between c and w. But this implies that 

/(n+D (j,) (a; - C )™+ 1 



/(-) - ( r (/,c)J (-) = g^ ^ ■ (4-123) 
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for some y between c and x, which finishes the proof of the theorem. 

Exercise 4.30 

Define / (x) = for x < and / (a;) = e~ x l x for x > 0. Verify that / e C°° (R) , that f^ (0) = 
for all n, and yet / is not expandable in a Taylor series around 0. Interpret Taylor's Remainder 
Theorem for this function. That is, describe the remainder R n +\ (x) . 

As a first application of Taylor's Remainder Theorem we give the following result, which should be familiar 
from calculus. It is the generalized version of what's ordinarily called the "second derivative test." 

Theorem 4.20: Test for Local Maxima and Minima 

Let / be a real- valued function in C n (c — r,c + r) , suppose that the n + 1st derivative f( n+1 > of 
/ exists everywhere on (c — r,c+ r) and is continuous at c, and suppose that f( k > (c) = for all 
1 < k < n and that /("+ 1 ) ( c ) ^ 0. Then: 

1. If n is even, / attains neither a local maximum nor a local minimum at c. In this case, c is 
called an inflection point. 

2. If n is odd and /( n+1 ) (c) < 0, then / attains a local maximum at c. 

3. If n is odd and /( n+1 ) (c) > 0, then / attains a local minimum at c. 

Proof: 

Since /(™ +1 ) is continuous at c, there exists a 6 > such that /( n+1 ) (y) has the same sign as 
yO+i) ( c ) f or a ii y g ( c _ ^ c _|_ £j _ w e nave by Taylor's Theorem that if x € (c — S, c + S) then 
there exists a y between x and c such that 

/ \ f( n + 1 ) (ii) 

f (x) = (r ( «, c) ) (x) + l ¥T ^(x - c) n+ \ (4.124) 



from which it follows that 



f(x)-f(c) = n =1 f (k Hc)ki(x-c) k +i^i(x- c r +i 



f in+1 Hy) ( T r \n+i 

(n+l)! ^ I 



(4.125) 



Suppose n is even. It follows then that if x < c, the sign of (x — c) is negative, so that 
the sign of / (x) — f (c) is the opposite of the sign of /("+ 1 ) (c) . On the other hand, if x > c, then 
(x — c) > 0, so that the sign of / (x) — f (c) is the same as the sign of /( n+1 ) (c) . So, / (x) > / (c) 
for all nearby x on one side of c, while / (x) < f (c) for all nearby x on the other side of c. Therefore, 
/ attains neither a local maximum nor a local minimum at c. This proves part (1). 

Now, if n is odd, the sign of / (x) — f (c) is the same as the sign of /( ra+1 ) (y) , which is the same 
as the sign of /(™ +1 ) (c) , for all x € (c - S, c + 5) . Hence, if f ( - n+1 '> (c) < 0, then / (x) - f (c) < 
for all x € (c — 5, c + 5) , showing that / attains a local maximum at c. And, if /( n+1 ) (c) > 0, then 
the sign of / (x) — f (c) is positive for all x € (c— 5,c+ 5) , showing that / attains a local minimum 
at c. This proves parts (2) and (3). 



4.10 The General Binomial Theorem 10 

We use Taylor's Remainder Theorem to derive a generalization of the Binomial Theorem to nonintegral 
exponents. First we must generalize the definition of binomial coefficient. 



°This content is available online at <http://cnx.Org/content/m36205/l.2/>. 
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Definition 4.9: 

Let a be a complex number, and let k be a nonnegative integer. We define the general binomial 
coefficient (^) by 

a\ a (a — 1) ... (a — k + 1) 

1 ; . (4.126) 



.kJ k\ 

If a is itself a positive integer and k < a, then ( ? ) agrees with the earlier definition of the binomial 
coefficient, and (?) = when k > a. However, if a is not an integer, but just an arbitrary complex 
number, then every (?) ^ 0. 

Exercise 4.31 

Estimates for the size of binomial coefficients. Let a be a fixed complex number. 

a. Show that 

iQi^n( 1+ y) (4 - I27) 

for all nonnegative integers k. HINT: Note that 

/q\ < |a| (|nZp/tn| + 1) (\alpha\ + 2) ... (\a\ + fc - 1) 

b. Use part (a) to prove that there exists a constant C such that 

, . < C2 k (4.129) 

k I 

for all nonnegative integers k. HINT: Note that (1 + |a|/j) < 2 for all j > \a\. 

c. Show in fact that for each e > there exists a constant C e such that 

\Q\<C e (l + e) k (4.130) 

for all nonnegative integers k. 

d. Let h {t) be the power series function given by h (t) = J2T=o ( fe) * ■ ^ se ^ ne ra tio test to show 
that the radius of convergence for h equals 1. 

4.14: 
REMARK The general Binomial Theorem, if there is one, should be something like the following: 

(*+yr = EQ^~v. ( 4 - 131 ) 

fe=0 

The problem is to determine when this infinite series converges, i.e., for what values of the three 
variables x, y, and a does it converge. It certainly is correct if x = 0, so we may as well assume 
that x ^ 0, in which case we are considering the validity of the formula 

oo 

(a; + y) a = x a (l + tf = X a J2{T) ^ ( 4A32 ) 

fe=0 

where t = y/x. Therefore, it will suffice to determine for what values of t and a does the infinite 
series 

oo 

ECK ( 4 -133) 

k=0 
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equal 

(l + i) Q . (4.134) 

The answer is that, for n arbitrary complex number a, this series converges to the correct value for 
all t € (— 1, 1) . (Of course, t must be larger than —1 for the expression (1 + t) a even to be defined.) 
However, the next theorem only establishes this equality for t's in the subinterval ( — 1/2,1/2). 
As mentioned earlier, its proof is based on Taylor's Remainder Theorem. We must postpone the 
complete proof to Section 5.1, where we will have a better version of Taylor's Theorem. 

Theorem 4.21: 

Let a = a + bi be a fixed complex number. Then 



fc=0 



for all te (-1/2,1/2). 
Proof: 

Of course, this theorem is true if a is a nonnegative integer, for it is then just the original Binomial 
Theorem, and in fact in that case it holds for every complex number t. For a general complex 
number a, we have only defined x a for positive x's, so that (1 + t) a is not even defined for t < — 1. 
Now, for a general a = a + bi, consider the function g : (—1/2,1/2) — > C defined by g(t) = 
(1 + t) a . Observe that the nth derivative of g is given by 

(„\ , „ a (a — 1) ... (a — n + 1) 

g (n) (t) = — ^—!-. 4.136 

y W (l + i )"- Q V ' 

Then g s C°° ((—1/2, 1/2)) . (Of course, g is actually in C°° (—1, 1) , but the present theorem is 
only concerned with t's in (—1/2, 1/2) .) 
For each nonnegative integer k define 

a k = 9^ (0) /*! = °(«-D->-*+l) = Q , (4.137) 

and set h equal to the power series function given by h (t) = J^'kLo a kt k - According to part (d) of 
the preceding exercise, the radius of convergence for the power series^ a,kt k is 1. The aim of this 
theorem is to show that g (t) = h (i) for all —1/2 < t < 1/2. In other words, we wish to show that 
g agrees with this power series function at least on the interval (—1/2, 1/2) . It will suffice to show 
that the sequence {S n } of partial sums of the power series function h converges to the function g, 
at least on (—1/2, 1/2) . We note also that the nth partial sum of this power series is just the nth 
Taylor polynomial T" for g. 

fe=0 ' fe=0 

Now, fix a t strictly between —1/2 and 1/2, and let r < 1 be as in part (c) of Exercise 4.31. That 
is, \tj (1 + y) | < r for every y between and t. (This is an important inequality for our proof, and 
this is one place where the hypothesis that t € ( — 1/2,1/2) is necessary.) Note also that, for any 
y e ( — 1/2, 1/2) , we have | (1 + y) a \ = (1 + y) a , and this is trapped between (l/2) a and (3/2)°. 
Hence, there exists a number M such that | (1 + y) a \ < M for all y s ( — 1/2, 1/2) . 

Next, choose an e > for which (3 = (1 + e)r < 1. We let C e be a constant satisfying the 
inequality in Part (c) of Exercise 4.31. So, using Taylor's Remainder Theorem, we have that there 



Ill 



exists a y between and t for which 



lff(*)-ELo«fc**l = \9(t)-(T^ 0) (t)\ 



'"'■ +1) (») t n+l| 



I (n+1)! 
| g(a-l)...(g-n) f rt+l| 
l(™+l)!(l+a)" +1 —' 



((^OiKi+ynii^r 1 (4-139) 



< 
< 

< C e {l + e) n+1 Mr n+1 

< C E M(3 n+ \. 



C £ (l + e) n+1 M\^- y \ n+1 



Taking the limit as n tends to oo, and recalling that /3 < 1, shows that g(t) = h(t) for all 
— 1/2 < £ < 1/2, which completes the proof. 



4.11 More on Partial Derivatives 11 

We close the chapter with a little more concerning partial derivatives. Thus far, we have discussed functions 
of a single variable, either real or complex. However, it is difficult not to think of a function of one complex 
variable z = x + iy as equally well being a function of the two real variables x and y. We will write (a, b) 
and a + hi to mean the same point in C = R 2 , and we will write | (a, b) | and \a + bi\ to indicate the same 
quantity, i.e., the absolute value of the complex number a + bi = (a, b) . We have seen in Theorem 4.4, 
p. 86 that the only real-valued, different iable functions of a complex variable are the constant functions. 
However, this is far from the case if we consider real- valued functions of two real variables, as is indicated in 
Exercise 4.8. Consequently, we make the following definition of differentiability of a real-valued function of 
two real variables. Note that it is clearly different from the definition of differentiability of a function of a 
single complex variable, and though the various notations for these two kinds of differentiability are clearly 
ambiguous, we will leave it to the context to indicate which kind we are using. 

Definition 4.10: 

Let / : S — » R be a function whose domain is a subset S of R 2 , and let c = (a, b) be a point in 
the interior 5° of S. We say that / is differentiable, as a function of two real variables, at the point 
(a, b) if there exists a pair of real numbers L\ and Li and a function such that 

/ (o + h u b + hi) - f (a, b) = L x h x + L 2 h 2 + 6 {h u h 2 ) (4.140) 

and 

0(hi,hi) 

lim i A t = °- 4 - 141 

One should compare this definition with part (3) of Theorem 4.2, p. 84. 

Each partial derivative of a function / is again a real- valued function of two real variables, and 
so it can have partial derivatives of its own. We use simplifying notation like f xyxx and f yy yxyy... 
to indicate "higher order" mixed partial derivatives. For instance, f xxyx denotes the fourth partial 
derivative of /, first with respect to x, second with respect to x again, third with respect to y, 
and finally fourth with respect to x. These higher order partial derivatives are called mixed partial 
derivatives. 



lr This content is available online at <http://cnx.Org/content/m36206/l.2/>. 
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Definition 4.11: 

Suppose S is a subset of R 2 , and that / is a continuous real- valued function on S. If both partial 
derivatives of / exist at each point of the interior S° of S, and both are continuous on S°, then / 
is said to belong to C 1 (S) . If all kth order mixed partial derivatives exist at each point of 5°, and 
all of them are continuous on S°, then / is said to belong to C k (S) . Finally, if all mixed partial 
derivatives, of arbitrary orders, exist and are continuous on S°, then / is said to belong to C°° (S) . 

Exercise 4.32 

a. Suppose / is a real-valued function of two real variables and that it is differentiable, as a 
function of two real variables, at the point (a, b) . Show that the numbers L\ and L 2 in the 
definition are exactly the partial derivatives of / at (a, b) . That is, 

t tialf f(a + h,b)-f(a,b) uta^ 

L\ = (a, b) = hm (4.142) 

tialx h^o h 

and 

t Half, ,s ,. f(a,b + h)-f(a,b) ( aia?\ 

Li = (a,b)=hm . (4.143 

tialy h^a h 

b. Define / on R 2 as follows: / (0, 0) = 0, and if (x, y) =/= (0, 0) , then / (x, y) = xyj [x 2 + y 2 ) . 
Show that both partial derivatives of / at (0,0) exist and are 0. Show also that / is not, as 
a function of two real variables, differentiable at (0, 0) . HINT: Let h and k run through the 
numbers \/n. 

c. What do parts (a) and (b) tell about the relationship between a function of two real variables 
being differentiable at a point (a, b) and its having both partial derivatives exist at (a, b)l 

d. Suppose / = u + iv is a complex-valued function of a complex variable, and assume that / is 
differentiable, as a function of a complex variable, at a point c = a + bi = (a,b) . Prove that 
the real and imaginary parts u and v of / are differentiable, as functions of two real variables. 
Relate the five quantities 

tialu tialu tialv tialv , ,,,,,\ 

—— (a, b) , — — (a, 6) , — — (a, b) , —— (a, b) , and / (c) . (4.144) 

tialx tialy tialx tialy 

Perhaps the most interesting theorem about partial derivatives is the "mixed partials are equal" theorem. 
That is, f xy = f yx . The point is that this is not always the case. An extra hypothesis is necessary. 

Theorem 4.22: Theorem on mixed partials 

Let / : S — » R be such that both second order partials derivatives f xy and f yx exist at a point 
(a, b) of the interior of S, and assume in addition that one of these second order partials exists 
at every point in a disk B r (a,b) around (a,b) and that it is continuous at the point (a, b) . Then 
fxy (a, b) = fy X (a, b) . 
Proof: 

Suppose that it is f yx that is continuous at (o, b) . Let e > be given, and let <5i > be such that 
if | (c, d) — (a, b) | < Si then \f yx (c, d) — f yx (a, b)\ < e. Next, choose a d> 2 such that if < |fc| < S 2: 
then 

,, ( M fx(a,b + k)-f x (a,b) 

\f xy {a,b) 1 < £, (4.145) 

and fix such a k. We may also assume that |fc| < Si/2. Finally, choose a ^3 > such that if 
< \h\ < S 3 , then 

, . / * f (a + h,b + k) — f (a,b + k) , 

\f x (a,b+k)-^ ! ' J{ ' >-\<\k\e, (4.146) 



and 
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\fx{a,b) - 1 < \k\e, (4.147) 



and fix such an h. Again, we may also assume that \h\ < S\/2. 

In the following calculation we will use the Mean Value Theorem twice. 

< \fxy{a,b) -f yx {a,b)\ 

< \f xy (a,b)- Ma ' b+ T Ma ' b) \ 

+ \ Ma ' b+ T Ma ' b) -fvAa,b)\ 

< £ , | /,(a.6+fc) - /(o + h - t+ ^- /( °- t+fc) | 



fe 
h - -fx(a,b) 



(4.148) 



1 I k I 

+ I f(a+hA+k)-f(a,b+k) + (f(a+h,b)-f(a,b)) _ , / jx , 

< 3c ! | /(a+fc,b+fc)-/(a,b+fc) + (/(a+ft,b)-/(a,fr)) I (a 6) I 

3e+ | /»(°+M , )-/.(°.* , ) _ /ya(a>6) | 

= 3e+ 1/^ (a', 6') - /^ (a, b) \ 

< 4e, 

because b' is between 6 and b + k, and o' is between a and + /1, so that | (a , 6) — (a, 6) | < 5i/y/2 < 
6\. Hence, l/^j, (a, b) — f yx (a, b) < 4e, for an arbitrary e, and so the theorem is proved. 

Exercise 4.33 

Let / be defined on R 2 by / (0, 0) = and, for (x, y) ^ (0, 0) ,/ (x, y) = x 3 y/ (x 2 + y 2 ) . 

a. Prove that both partial derivatives f x and f y exist at each point in the plane. 

b. Show that f yx (0, 0) = 1 and f xy (0, 0) = 0. 

c. Show that f xy exists at each point in the plane, but that it is not continuous at (0, 0) . 

The following exercise is an obvious generalization of the First Derivative Test for Extreme Values, Theo- 
rem 4.8, First Derivative Test for Extreme Values, p. 92, to real-valued functions of two real variables. 

Exercise 4.34 

Let / : S — > R be a real-valued function of two real variables, and let c = (a, b) e S° be a point at 
which / attains a local maximum or a local minimum. Show that if either of the partial derivatives 
tialf/tialx or tialf/tialy exists at c, then it must be equal to 0. 

HINT: Just consider real- valued functions of a real variable like x — > / (x, b) or y — » f (a,y) , 
and use . 

Whenever we make a new definition about functions, the question arises of how the definition fits with 
algebraic combinations of functions and how it fits with the operation of composition. In that light, the next 
theorem is an expected one. 

Theorem 4.23: 

(Chain Rule again) Suppose S is a subset of R 2 , that (a, b) is a point in the interior of S, and 
that / : S — > R is a real-valued function that is different iable, as a function of two real variables, 
at the point (a, b) . Suppose that T is a subset of R, that c belongs to the interior of T, and that 
<f> : T — > R 2 is different iable at the point c and <j> (c) = (a, b) . Write <f> (t) = (x (t) , y (tj) . Then the 
composition / o <f> is differentiable at c and 
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Proof: 

From the definition of differentiability of a real- valued function of two real variables, write 

f{a+h 1 ,b+h 2 )-f (a, b) = L x hx + L 2 h 2 + f (H 1} ft 2 ) . (4.150) 

and from part (3) of Theorem 4.2, p. 84, write 

4> (c + ft) - 4> (c) = 4 (c) h + Oj, (ft) , (4.151) 

or, in component form, 

x (c + ft) - x (c) = x (c + ft) - a = x (c) ft + 6 X (ft) (4.152) 

and 

y (c + ft) - y (c) = y (c + ft) - b = y (c) ft + 9 y (ft) . (4.153) 

We also have that 

Urn e -ij^f = 0, (4-154) 

|(fci,/i2)|-o |(fti,ft 2 )| 

Ox (ft) 
lim^-^ = 0, 4.155 

h^o ft 

and 

6 V (ft) 
U m J*±J- = o. 4.156 

h^O ft 

We will show that fo<j> is differentiable at c by showing that there exists a number L and a function 
9 satisfying the two conditions of part (3) of Theorem 4.2, p. 84. 
Define 

fci (ft) , fc 2 (ft) = <Hc + ft) - <£ (c) = (x (c + ft) - x (c) , y (c + ft) - y (c)) . (4.157) 

Thus, we have that 

/o^(c+/O-/o0(c) = f(<l>(c + h))-f(<j>(c)) 

f(x(c+h),y(c+h))-f(x(c),y(c)) 

f(a + ki(h),b + k2(h))-f(a,b) 

L x hx (ft) + L 2 k 2 (ft) + f {k x (ft) , k 2 (ft)) 

h(x(c+h)-x (c)) + L 2 (y (c + ft) - y (c)) 

+ 0f{k 1 {h),k 2 {h)) 
Lx (a:' (c) ft + X (ft)) + L 2 (y (c) ft + 0„ (ft)) 

+ 9 f (k 1 (h),k 2 (h)) 

= (Lix (c) + L 2 y (c)) ft 

+ Lx9 x (ft) + L 2 y (ft) + Of (fci (ft) , fc 2 (ft)) . 

We define L = (Lix' (c) + L 2 y (c)) and (9 (ft) = l x 9 x (ft) + L 2 ^ (ft) + Of (fci (ft) , fc 2 (ft)) . By these 
definitions and the calculation above we have Equation (4.1) 

fo(/>(c + h)- fo<j>(c) = Lh + 6(h), (4.159) 



(4.158) 
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so that it only remains to verify (4.14) for the function 9. We have seen above that the first two 
parts of satisfy the desired limit condition, so that it is just the third part of 8 that requires 
some proof. The required argument is analogous to the last part of the proof of the Chain Rule 
(Theorem 4.7, Chain Rule, p. 89), and we leave it as an exercise. 

Exercise 4.35 

a. Finish the proof to the preceding theorem by showing that 

9 f (ki(h),k 2 (h)) 

Urn ' K y '' K ' ' = 0. 4.160 

h^O h 

HINT: Review the corresponding part of the proof to Theorem 4.7, Chain Rule, p. 89. 

b. Suppose / : S — > 7? is as in the preceding theorem and that cf> is a real-valued function of 
a real variable. Suppose / is differentiable, as a function of two real variables, at the point 
(a, b) , and that <fi is differentiable at the point c = f (a,b) . Let g = <j>o f. Find a formula for 
the partial derivatives of the real- valued function g of two real variables. 

c. (A generalized Mean Value Theorem) Suppose u is a real- valued function of two real variables, 
both of whose partial derivatives exist at each point in a disk B r (a, b) . Show that, for any 



two points (x,y) and (x ,y) in B r {a,b), there exists a point i,!/ on the line segment 
joining (x,y) to (x',y') such that 

. / > >\ tialu f" ~\ , ,n tialu f" ~\ , . N . , „„„. 

u{x,y)-u(x,y) = ^—^ \x,yj(x-x) + j-j- lx,yj(y-y). (4.161) 

HINT: Let </> : [0,1] -> R 2 be defined by <j>(t) = (l-t)(x',y) +t(x,y). Now use the 
preceding theorem, 
d. Verify that the assignment / — * tialf/tialx is linear; i.e., that 

tied (/ + g) = tialf tialg 
tialx tialx tialx 

Check that the same is true for partial derivatives with respect to y. 
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Chapter 5 

Integration, Average Behavior 



5.1 Integration, Average Behavior A=ir r~2 x 

In this chapter we will derive the formula A = nr 2 for the area of a circle of radius r. As a matter of fact, 
we will first have to settle on exactly what is the definition of the area of a region in the plane, and more 
subtle than that, we must decide what kinds of regions in the plane "have" areas. Before we consider the 
problem of area, we will develop the notion of the integral (or average value) of a function defined on an 
interval [a, b] , which notion we will use later to compute areas, once they have been defined. 
The main results of this chapter include: 

1. The definition of integrability of a function, and the definition of the integral of an integrable 
function, 

2. The Fundamental Theorem of Calculus (Theorem 5.9, Fundamental Theorem of Calculus, p. 131), 

3. The Integral Form of Taylor's Remainder Theorem (Theorem 5.12, Integral Form of Taylor's 
Remainder Theorem, p. 134), 

4. The General Binomial Theorem (Theorem 5.13, General Binomial Theorem, p. 134), 

5. The definition of the area of a geometric set, 

6. A = 7rr 2 (Theorem 5.15, p. 139), and 

7. The Integral Test (Theorem 5.17, p. 141). 



5.2 Integrals of Step Functions 2 

We begin by defining the integral of certain (but not all) bounded, real-valued functions whose domains 
are closed bounded intervals. Later, we will extend the definition of integral to certain kinds of unbounded 
complex-valued functions whose domains are still intervals, but which need not be either closed or bounded. 
First, we recall from Section 3.1 the following definitions. 

Definition 5.1: 

Let [a, b] be a closed bounded interval of real numbers. By a partition of [a, b] we mean a finite set 
P = { x (\ < x i < ••• < x n } of n + 1 points, where xq = a and x n = b. 

The n intervals {[a;i_i,a;i]} are called the closed subintervals of the partition P, and the n 
intervals {(xi-i,Xi)} are called the open subintervals or elements of P. 

We write || P || for the maximum of the numbers (lengths of the subintervals) {xi — Xi-i}, and 
call || P || the mesh size of the partition P. 



1 This content is available online at <http://cnx.Org/content/m36207/l.2/>. 
2 This content is available online at <http://cnx.Org/content/m36208/l.2/>. 
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If a partition P = {xi} is contained in another partition Q = {yj}, i.e., each X{ equals some yj, 
then we say that Q is finer than P. 

Let / be a function on an interval [a, b] , and let P = {x < ... < x n } be a partition of [a,b] . 
Physicists often consider sums of the form 

n 

S P;{Vi} =^2f(yi)( X i- X i-l)' ( 5 -!) 

i=\ 

where y, is a point in the subinterval {xi-\,xi) . These sums (called Riemann sums) are approxi- 
mations of physical quantities, and the limit of these sums, as the mesh of the partition becomes 
smaller and smaller, should represent a precise value of the physical quantity. What precisely is 
meant by the " limit" of such sums is already a subtle question, but even having decided on what 
that definition should be, it is as important and difficult to determine whether or not such a limit 
exists for many (or even any) functions /. We approach this question from a slightly different point 
of view, but we will revisit Riemann sums in the end. 

Again we recall from Section 3.1 the following. 

Definition 5.2: 

Let [a, b] be a closed bounded interval in R. A real- valued function h : [a,b] — > R is called a step 
function if there exists a partition P = {xo < x\ < ... < x n } of [a, b] such that for each 1 < i < n 
there exists a number Oj such that h (x) = cii for all x € (xi—i, xi) ■ 

5.1: 

REMARK A step function h is constant on the open subintervals (or elements) of a certain 
partition. Of course, the partition is not unique. Indeed, if P is such a partition, we may add 
more points to it, making a larger partition having more subintervals, and the function h will still 
be constant on these new open subintervals. That is, a given step function can be described using 
various distinct partitions. 

Also, the values of a step function at the partition points themselves is irrelevant. We only 
require that it be constant on the open subintervals. 

Exercise 5.1 

Let h be a step function on [a, b] , and let P = {xo < x\ < ... < x n } be a partition of [a, b] such 
that h(x) = a,i on the subinterval (x,i-\,xi) determined by P. 

a. Prove that the range of h is a finite set. What is an upper bound on the cardinality of this 
range? 

b. Prove that h is differentiable at all but a finite number of points in [a, b] . What is the value 
of ti at such a point? 

c. Let / be a function on [a, b] . Prove that / is a step function if and only if /' (x) exists and 
= for every x G (a, b) except possibly for a finite number of points. 

d. What can be said about the values of h at the endpoints {xi\ of the subintervals of PI 

e. (e) Let h be a step function on [a, b] , and let j be a function on [a, b] for which h (x) = j (x) 
for all x g [a, b] except for one point c. Show that j is also a step function. 

f. If A; is a function on [a, b] that agrees with a step function h except at a finite number of 
points ci, C2, ..., cat, show that k is also a step function. 

Exercise 5.2 

Let [a, b] be a fixed closed bounded interval in R, and let H ([a, b}) denote the set of all step 
functions on [a, b] . 

a. Using Part (c) of Exercise 5.1, prove that the set H ([a, b}) is a vector space of functions; i.e., 
it is closed under addition and scalar multiplication. 
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b. Show that H ([a, b]) is closed under multiplication; i.e., if hi,fi2 € H ([a,b]) , then hih,2 € 
H([a,b]). 

c. Show that H ([a,b]) is closed under taking maximum and minimum and that it contains all 
the real-valued constant functions. 

d. We call a function \ an indicator function if it equals 1 on an interval (c, d) and is outside 
[c, d] . To be precise, we will denote this indicator function by X(c,d)- Prove that every indicator 
function is a step function, and show also that every step function h is a linear combination 
of indicator functions: 

n 

h = ^2 a jX( Cj ,d J )- ( 5 - 2 ) 

i=i 

e. Define a function k on [0, 1] by setting k (x) = if x is a rational number and k (x) = 1 if 
x is an irrational number. Prove that the range of k is a finite set, but that k is not a step 
function. 

Our first theorem in this chapter is a fundamental consistency result about the "area under the graph" of 
a step function. Of course, the graph of a step function looks like a collection of horizontal line segments, 
and the region under this graph is just a collection of rectangles. Actually, in this remark, we are implicitly 
thinking that the values {aj} of the step function are positive. If some of these values are negative, then we 
must re-think what we mean by the area under the graph. We first introduce the following bit of notation. 

Definition 5.3: 

Let libea step function on the closed interval [a, b] . Suppose P = {xo < x\ < ... < x n } is a 
partition of [a, b] such that h(x) = a,i on the interval (xi_i,Xi) . Define the weighted average of 
hrelative toP to be the number Sp (h) defined by 

n 

Sp (h) = y^ aj (xj - gj-i) ■ (5.3) 

i=i 

5.2: 

REMARK Notice the similarity between the formula for a weighted average and the formula for 
a Riemann sum. Note also that if the interval is a single point, i.e., a = b, then the only partition 
P of the interval consists of the single point xo = a, and every weighted average Sp (h) = 0. 

The next theorem is not a surprise, although its proof takes some careful thinking. It is simply the 
assertion that the weighted averages are independent of the choice of partition. 

Theorem 5.1: 

Let h be a step function on the closed interval [a, b] . Suppose P = {xq < x\ < ... < x n } is a partition 
of [a,b] such that h{x) = a,i on the interval (xi-i,Xi) , and suppose Q = {j/o < Hi < ••• < Vm} is 
another partition of [a, b] such that h (x) = bj on the interval (yj-i, Vj) ■ Then the weighted average 
of h relative to P is the same as the weighted average of h relative to Q. That is, Sp (h) = Sq (h) . 
Proof: 

Suppose first that the partition Q is obtained from the partition P by adding one additional point. 
Then m = n + 1, and there exists an Iq between 1 and n — 1 such that 

1. for < i < «o we have yi = X{. 

2. x io < y io+ i < x io+1 . 

3. For iq < i < n we have Xi = yi+i- 

In other words, j/j +i is the only point of Q that is not a point of P, and yi +\ lies strictly between 
x ia and x io+ i. 

Because h is constant on the interval (xi ,Xi +i) = {yi ,yi +2) , it follows that 
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1. For I < i < io,a>i = bi. 

2. h„+i = ^o+2 = a*o+i- 

3. For i + 1 < i < n,di = 6,+i. 

So, 

Sp(h) = E"=i a » (xi - Xi-i) 

= J2?=l a i (%i - x i-l) + ai +i (x io+ i - x io ) 

= ElLi h {yi - yi-i) + a io+ i (y io+2 - Vi ) 

+ T,7=i +2bi+i(yi+i-yi) 
= T,T=ibi{yi-yi-i) + ai 0+ i{yi 0+ 2-yi 0+ i + y io+ i-yi ) (5.4) 

+ ES +3 bi (yi - yi-i) 
= ElLi ^ (y» - Vt-i) + b io+i (Wo+i - Wo) + 6 *o+2 (j/io+2 - yio+i) 

+ HZi +3 b i(yi-yi-i) 

= Y,7U b i(y*-yi-i-) 

s Q (h) , 

which proves the theorem in this special case where Q is obtained from P by adding just one more 
point. 

It follows easily now by induction that if Q is obtained from P by adding any finite number 
of additional points, then h is constant on each of the open subintervals determined by Q, and 
S Q (h) = S P (h). 

Finally, let Q = {yo < y\ < ... < y m } be an arbitrary partition of [a, b] , for which h is constant 
on each of the open subintervals (yj_ 1 ,yj) determined by Q. Define R to be the partition of [a, b] 
obtained by taking the union of the partition points {x^ and {yj}- Then R is a partition of [a, b] that 
is obtained by adding a finite number of points to the partition P, whence Sr (h) = Sp (h) . Likewise, 
R is obtained from the partition Q by adding a finite number of points, whence Sp (h) = Sq (h) , 
and this proves that Sq (h) = Sp (h) , as desired. 

Definition 5.4: 

Let [a, b] be a fixed closed bounded interval in R. We define the integral of a step function h on 
[a, b] , and denote it by J h, as follows: If P = {xq < X\ < ... < x n } is a partition of [a, b] , for which 
h (x) = ai for all x G (xi-i,Xi) , then 

/n 
h = S P (h) = y^ a, (xj - Xj-\) . (5.5) 

i=l 

5.3: 

REMARK The integral of a step function h is defined to be the weighted average of h relative 
to a partition P of [a, b] . Notice that the preceding theorem is crucial in order that this definition 
of J h be unambiguously defined. The integral of a step function should not depend on which 
partition is used. Theorem 5.1, p. 119 asserts precisely this fact. 

Note also that if the interval is a single point, i.e., a = b, then the integral of every step function 
h is 0. 
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We use a variety of notations for the integral of h : 

h= h = h(t) dt. (5.6) 

J a J a 

The following exercise provides a very useful way of describing the integral of a step function. Not 
only does it show that the integral of a step function looks like a Riemann sum, but it provides a 
description of the integral that makes certain calculations easier. See, for example, the proof of the 
next theorem. 

Exercise 5.3 

Suppose h is a step function on [a, b] and that R = {zo < z\ < ... < z n } is a partition of [a, b] for 
which h is constant on each subinterval (zi-i,Zi) of R. 



a. Prove that 



/n 
h = S R (h) = y y j h (wj) {zj - Zj-i) , (5.7) 



where, for each 1 < i < n,w, is any point in (zj_i, z{) . (Note then that the integral of a step 
function takes the form of a Riemann sum.) 
b. Show that J h is independent of the values of h at the points {zi} of the partition R. 

Exercise 5.4 

Let hi and hi be two step functions on [a, b] . 

a. Suppose that h\ (x) = hi (x) for all x € [a, b] except for one point a Prove that J hi = J hi. 
HINT: Let P be a partition of [a, b] , for which both hi and hi are constant on its open 
subintervals, and for which c is one of the points of P. Now use the preceding exercise to 
calculate the two integrals. 

b. Suppose hi (x) = hi (x) for all but a finite number of points ci, ...,cn € [a, b] . Prove that 
Jhi = Jhi. 

We have used the terminology "weighted average" of a step function relative to a partition P. The next 
exercise shows how the integral of a step function can be related to an actual average value of the function. 

Exercise 5.5 

Let ft be a step function on the closed interval [a,b] , and let P = {xq < x\ < ... < x n } be a 
partition of [a, b] for which h(x) = a; on the interval (xi_i,Xj) . Let us think of the interval [a, b] 
as an interval of time, and suppose that the function h assumes the value a% for the interval of 
time between Xi-i and Xj. Show that the average value A (h) taken on by h throughout the entire 
interval ([a, 6]) of time is given by 

A{h) = l^. (5.8) 

b — a 

Theorem 5.2: 

Let H ([a, b\) denote the vector space of all step functions on the closed interval [a,b] . Then the 
assignment h — * J h of H ([a, b]) into R has the following properties: 

1. (Linearity) H ([a, b\) is a vector space. Furthermore, J (hi + hi) = J hi + J hi, and J ch = 
c f h for all hi, hi, h € H ([a, b}) , and for all real numbers c. 

2. If h = YH=i a iX(ci ,di) is a linear combination of indicator functions (See part (d) of Exer- 
cise 5.2), then J h = Y17=i a i (^» — c «) • 

3. (Positivity) If h (x) > for all x £ [a, b] , then Jh>0. 
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4. (Order-preserving) If hi and hi are step functions for which h\ (x) < hi (x) for all x G [a, b] , 
then J hi < J hi. 

Proof: 

That H ([a, b}) is a vector space was proved in part (a) of Exercise 5.2. Suppose P = {xq < 
x\ < ... < x n } is a partition of [a, b] such that hi (x) is constant for all x € (a;»-i, Xi) , and suppose 
Q = {no < Vi < ■■■ < Vm} is a partition of [a, b] such that hi (x) is constant for all x € (Vj-i, Vj) ■ Let 
R = {zo < z\ < ... < z r } be the partition of [a, b] obtained by taking the union of the Xj's and the 
j/j's. Then h\ and /12 are both constant on each open subinterval of R, since each such subinterval 
is contained in some open subinterval of P and also is contained in some open subinterval of Q. 
Therefore, hi + hi is constant on each open subinterval of R. Now, using Exercise 5.3, we have that 

J(hi+h 2 ) = Efc=i(( /l i + h 2){w k )){zk ~ Zfc-i) 

= J2l=i h i ( w k) (zk - Zk-i) + J2i=i hl 2 (wk) {zk - z k -i) (5.9) 

Jhi+Jh 2 . 

This proves the first assertion of part (1). 

Next, let P = {xo < xi < ... < x n } be a partition of [a,b] such that h (x) is constant on each 
open subinterval of P. Then ch (x) is constant on each open subinterval of P, and using Exercise 5.3 
again, we have that 

/ i ch ) = ^ =1 ch(wi)(xi-Xi-i) 

= cY^ih(wi)(xi-Xi-i) (5-10) 

cjh, 

which completes the proof of the other half of part (1). 

To see part (2), we need only verify that /x(ci,d 4 ) = di — Ci, for then part (2) will follow from 
part (1). But X(ci,di) ls J us t a step function determined by the four point partition {a, Ci, di, b} and 
values on (a,Ci) and (di,b) and 1 on (ci,di) . Therefore, we have that J Xfc^dA = di — Ci- 

If h (x) > for all x, and P = {xo < xi < ... < x n } is as above, then 

/n 
h = Y J h{w l ){x l -x l -i)>0, (5.11) 

i=l 

and this proves part (3). 

Finally, suppose hi (x) < hi (x) for all x € [a, b] . By Exercise 5.2, we know that the function 
ha = hi — h\ is a step function on [a, b] . Also, /13 (x) > for all x G [a, b] . So, by part (3), / h$ > 0. 
Then, by part (1), 

0< J h 3 = J \hi-hi) = J hi- J hi, (5.12) 

which implies that J hi < J hi, as desired. 

Exercise 5.6 

a. Let h be the constant function c on [a, b] . Show that J h = c(b — a) . 

b. Let a<c<d<bbe real numbers, and let h be the step function on [a, b] that equals r for 
c < x < d and otherwise. Prove that J h(i) dt = r (d — c) . 

c. Let h be a step function on [a, b] . Prove that \h\ is a step function, and that \ J h\ < J \h\. 
HINT: Note that —\h\ (x) < h (x) < \h\ (x) . Now use the preceding theorem. 
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d. Suppose ft is a step function on [0,6] and that c is a constant for which \h(x) | < c for all 
x G [a, b] . Prove that | f h\ < c (b — a) . 



5.3 Integrable Functions 3 

We now wish to extend the definition of the integral to a wider class of functions. This class will consist of 
those functions that are uniform limits of step functions. The requirement that these limits be uniform is 
crucial. Pointwise limits of step functions doesn't work, as we will see in Exercise 5.7 below. The initial step 
in carrying out this generalization is the following. 

Theorem 5.3: 

Let [a, b] be a closed bounded interval, and let {h n } be a sequence of step functions that converges 
uniformly to a function / on [a, b] . Then the sequence {/ h n } is a convergent sequence of real 
numbers. 
Proof: 

We will show that {J h n } is a Cauchy sequence in R. Thus, given an e > 0, choose an N such that 
for any n > N and any x € [a, b] , we have 

\f(x)-h n (x)\< £ (5.13) 

2 (0 — a) 

Then, for any m and n both > N and any x e [a, b] , we have 



Therefore, 



as desired. 



\h n 0) - h m (x) I < \h n 0) -f(x)\ + \f 0) - h m (x) I < -i-. (5.14) 

b — a 



I f K - f hm\ = I f \h n -h m )\< f \K - h m \ < f ^ = £, (5.15) 



The preceding theorem provides us with a perfectly good idea of how to define the integral of a function 
/ that is the uniform limit of a sequence of step functions. However, we first need to establish another kind 
of consistency result. 

Theorem 5.4: 

If {h n } and {k n } are two sequences of step functions on [a, b] , each converging uniformly to the 
same function /, then 



Urn I h n = Urn I k n . (5.16) 

Proof: 

Given e > 0, choose N so that if n > N, then \h n (x) — f (x) \ < ej (2 (b — a)) for all x € [a, b] , and 
such that |/ (x) — k n (x) \ < e/ (2 (b — a)) for all x e [a, b] . Then, \h n (x) — k n (x) \ < e/ (b — a) for 
all x £ [a, b] if n > N. So, 

\ f h n - f k n \< f\h n -k n \< f j-^-=e (5.17) 



3 This content is available online at <http://cnx.Org/content/m36209/l.2/>. 
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if n > N. Taking limits gives 

\lim h n — lim / k n \ < e. (5.18) 

Since this is true for arbitrary e > 0, it follows that lim J h n = lim J k n , as desired. 

Definition 5.5: 

Let [a, b] be a closed bounded interval of real numbers. A function / : [a, b] — » R is called integrable 
on [a, b] if it is the uniform limit of a sequence {h n } of step functions. 

Let / ([a, 6]) denote the set of all functions that are integrable on [a, b] . If / € J ([a, &]) , define 
the integral of /, denoted / /, by 

ff = limfh n , (5.19) 

where {/i n } is some (any) sequence of step functions that converges uniformly to / on [a, b] . 
As in the case of step functions, we use the following notations: 

f= f f= f f(t)dt. (5.20) 



5.4: 
REMARK Note that Theorem 5.4, p. 123 is crucial in order that this definition be unambiguous. 
Indeed, we will see below that this critical consistency result is one place where uniform limits of 
step functions works while pointwise limits do not. See parts (c) and (d) of Exercise 5.7. Note 
also that it follows from this definition that J f = 0, because J h = for any step function. In 
fact, we will derive almost everything about the integral of a general integrable function from the 
corresponding results about the integral of a step function. No surprise. This is the essence of 
mathematical analysis, approximation. 

Exercise 5.7 

Define a function / on the closed interval [0, 1] by / (x) = 1 if x is a rational number and / (x) = 
if x is an irrational number. 

a. Suppose ft is a step function on [0, 1] . Prove that there must exist an x € [0, 1] such that 
|/ (x) — h (x) | > 1/2. HINT: Let (xi-\, Xi) be an interval on which h is a constant c. Now use 
the fact that there are both rationals and irrationals in this interval. 

b. Prove that / is not the uniform limit of a sequence of step functions. That is, / is not an 
integrable function. 

c. Consider the two sequences {h n } and {k n } of step functions defined on the interval [0, 1] 
by h n = X(o.i/n)j an d k n = nx(o,i/n)- Show that both sequences {h n } and {k n } converge 
pointwise to the function on [0, 1] . HINT: All functions are at x = 0. For x > 0, choose 
N so that 1/N < x. Then, for any n > N,h n (x) = k n (x) = 0. 

d. Let h n and k n be as in part (c). Show that lim J h n = 0, but lim J k n = 1. Conclude that the 
consistency result in Theorem 5.4, p. 123 does not hold for pointwise limits of step functions. 

Exercise 5.8 

Define a function / on the closed interval [0, 1] by / (x) = x. 

a. For each positive integer n, let P n be the partition of [0, 1] given by the points {0 < 1/n < 
2/n < 3/n < ... < (n — 1) /n < 1}. Define a step function h n on [0, 1] by setting h n (x) = i/n 
if — - < x < -, and h n (i/n) = i/n for all < i < n. Prove that \f (x) — h n (x) | < 1/n for all 
x g [0, 1] , and then conclude that / is the uniform limit of the h n 's whence f € I ([0, 1]) . 
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b. Show that 

Ei n (n + 1) , 

i=l 

c. Show that f f (t) dt = 1/2. The next exercise establishes some additional properties of 
integrable functions on an interval [a, b] . 

Exercise 5.9 

Let [a, b] be a closed and bounded interval, and let / be an element of I ([a,b]) . 

a. Show that, for each e > there exists a step function h on [a, b] such that \f (x) — h (x) \ < e 
for all x € [a, b] . 

b. For each positive integer n let h n be a step function satisfying the conclusion of part (a) for 
e = l/n. Define k n = h n — l/n and l n = h n + l/n. Show that k n and l n are step functions, 
that k n (x) < f (x) < l n (x) for all x € [a, b] , and that \l n (x) — k n (x) \ = l n (x) — k n (x) = 2/n 
for all x. Hence, J (l n — k n ) = - (b — a) . 

c. Conclude from part (b) that, given any e > 0, there exist step functions k and I such that 
k{x) < f (x) < I (x) for which / (/ (x) - k (x)) < e. 

d. Prove that there exists a sequence {j n } of step functions on [a, b] , for which j n (x) < j n +i {%) < 
/ (x) for all x, that converges uniformly to /. Show also that there exists a sequence {j' n } of 
step functions on [a, b] , for which j n (x) > j n+1 (x) > f (x) for all x, that converges uniformly 
to /. That is, if / g i" ([o, b]) , then / is the uniform limit of a nondecreasing sequence of step 
functions and also is the uniform limit of a nonincreasing sequence of step functions. HINT: 
To construct the j„'s and j' n 's, use the step functions k n and /„ of part (b), and recall that 
the maximum and minimum of step functions is again a step function. 

e. Show that if / (x) > for all x s [a, b] , and g is defined by g (x) = \J f (x), then g £ I ([a,b]) . 
HINT: Write / = limh n where h n (x) > for all x and n. Then use part (g) of Exercise 3.28. 

f. (Riemann sums again.) Show that, given an e > 0, there exists a partition P such that if 
Q = {^o < Xi < ... < x n } is any partition finer than P, and {w{} are any points for which 
Wi G (xi-i,Xi) , then 

/b n 

f(t)dt-j2f o*) (^ - *i-i) i < e - ( 5 - 22 ) 

HINT: Let P be a partition for which both the step functions k and I of part (c) are constant 
on the open subintervals of P. Verify that for any finer partition Q,l (wi) > f (wi) > k (wi) , 
and hence 

^ I (m) (Xi - Xi-i) >^f (wi) (Xi - Xi-i) >^k (wi (xi - Xi-i) . (5.23) 

iii 

Definition 5.6: 

A bounded real- valued function / on a closed bounded interval [a, b] is called Riemann-integrable 
if, given any e > 0, there exist step functions k and I, on [a, 6] for which k (x) < f (x) < I (x) for all 
x, such that J (I — k) < e. We denote the set of all functions on [a, b] that are Riemann-integrable 
by I R ([a,b}). 

5.5: 
REMARK The notion of Riemann-integrability was introduced by Riemann in the mid nineteenth 
century and was the first formal definition of integrability. Since then several other definitions have 
been given for an integral, culminating in the theory of Lebesgue integration. The definition of 
integrability that we are using in this book is slightly different and less general from that of Riemann, 
and both of these are very different and less general from the definition given by Lebesgue in the 
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early twentieth century. Part (c) of Exercise 5.9 above shows that the functions we are calling 
integrable are necessarily Riemann-integrable. We will see in Exercise 5.10 that there are Riemann- 
integrable functions that are not integrable in our sense. In both cases, Riemann's and ours, an 
integrable function / must be trapped between two step functions k and I. In our definition, we 
must have I (x) — k (x) < e for all x € [a, b] , while in Riemann's definition, we only need that 
J I — k < e. The distinction is that a small step function must have a small integral, but it isn't 
necessary for a step function to be (uniformly) small in order for it to have a small integral. It only 
has to be small on most of the interval [a, b] . 

On the other hand, all the definitions of integrability on [a, b] include among the integrable 
functions the continuous ones. And, all the different definitions of integral give the same value to a 
continuous function. The differences then in these definitions shows up at the point of saying exactly 
which functions are integrable. Perhaps the most enlightening thing to say in this connection is 
that it is impossible to make a "good" definition of integrability in such a way that every function 
is integrable. Subtle points in set theory arise in such attempts, and many fascinating and deep 
mathematical ideas have come from them. However, we will stick with our definition, since it is 
simpler than Riemann's and is completely sufficient for our purposes. 

Theorem 5.5: 

Let [a, b] be a fixed closed and bounded interval, and let I([a,b\) denote the set of integrable 
functions on [a, b] . Then: 

1. Every element of / ([a, b]) is a bounded function. That is, integrable functions are necessarily 
bounded functions. 

2. I ([a, b}) is a vector space of functions. 

3. I ([a, b}) is closed under multiplication; i.e., if / and g & I ([a, b\) , then fg G I ([a, b}) . 

4. Every step function is in / ([a, b\) . 

5. If / is a continuous real-valued function on [a, b] , then / is in / ([a, 6]) . That is, every contin- 
uous real- valued function on [a, b] is integrable on [a, b] . 

Proof: 

Let / e I ([a, b}) , and write / = limh n , where {h n } is a sequence of step functions that converges 
uniformly to /. Given the positive number e = 1, choose N so that \f (x) — Iin (x) \ < 1 for all 
x g [a, b) . Then \f (x) | < |/ijv (x) | + 1 for all x € [a, b] . Because h^ is a step function, its range 
is a finite set, so that there exists a number M for which \Hn (%) I 5= M for all x € [a, b] . Hence, 
1/ (x) | < M + 1 for all x € [a, b] , and this proves part (1). 

Next, let / and g be integrable, and write / = limh n and g = limk n , where {h n } and {k n } 
are sequences of step functions that converge uniformly to / and g respectively. If s and t are real 
numbers, then the sequence {sh n + tk n } converges uniformly to the function sf + tg. See parts (c) 
and (d) of Exercise 3.28. Therefore, sf + tg € I ([a, b\) , and I ([a, b}) is a vector space, proving part 
(2). 

Note that part (3) does not follow immediately from Exercise 3.28; the product of uniformly 
convergent sequences may not be uniformly convergent. To see it for this case, let / = limh n and 
g = limkn be elements of / ([a, b\) . By part (1), both / and g are bounded, and we write Mf and 
M g for numbers that satisfy \f (x) | < Mf and \g (x) | < M g for all x € [a, b] . Because the sequence 
{k n } converges uniformly to g, there exists an N such that if n > N we have \g (x) — k n (x) | < 1 
for all x g [a, b] . This implies that, if n > N, then \k n (x) | < M g + 1 for all x € [a, b] . 

Now we show that fg is the uniform limit of the sequence h n k n . For, if n > N, then 

\f (x) g (x) - h n (x) k n (x) | = \f (x) g (x) - f (x) k„ (x) + / (x) k n (x) - h n (x) k n (x) | 

< |/(x)|| 5 (x)-fc n (x)| + |fc n (x)||/(x)-Mx)| ( 5 - 24 ) 

< M f \g (x) - k n (x) | + (M g + 1) |/ (x) - h n (x) |, 

which implies that fg = lira (h n k n ) • 
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If h is itself a step function, then it is obviously the uniform limit of the constant sequence {h}, 
which implies that h is integrable. 

Finally, if / is continuous on [a, 6] , it follows from Theorem 3.21, p. 78 that / is the uniform 
limit of a sequence of step functions, whence / £ / ([a, b\) . 

Exercise 5.10 

Let / be the function defined on [0, 1] by / (cc) = sin (l/x) if x ^ and / (0) = 0. 

a. Show that / is continuous at every nonzero x and discontinuous at 0. HINT: Observe that, 
on any interval (0,6) , the function sin (l/x) attains both the values 1 and — 1. 

b. Show that / is not integrable on [0, 1] . HINT: Suppose / = limh n . Choose N so that \f (x) — 
fojv (x) | < 1/2 for all x £ [0, 1] . Let P be a partition for which Hn is constant on its open 
subintervals, and examine the situation for a;'s in the interval (a;o,ari) . 

c. Show that /is Riemann- integrable on [0,1]. Conclude that I([a,b\) is a proper subset of 
I R ([a,b}). 

Exercise 5.11 

a. Let / be an integrable function on [a, b] . Suppose g is a function for which g (x) = f (x) for 
all x £ [a, b] except for one point a Prove that g is integrable and that J g = J f. HINT: If 
/ = limh n , define k n (x) = h n (x) for all x ^ c and k n (c) = g (c) . Then use Exercise 5.4. 

b. Again, let / be an integrable function on [a, b] . Suppose g is a function for which g (x) = / (x) 
for all but a finite number of points ci,...,Cjv € l a , b] ■ Prove that g £ I([a,b\), and that 

l9 = If- 

c. Suppose / is a function on the closed interval [a, b] , that is uniformly continuous on the 
open interval (a, b) . Prove that / is integrable on [a, b] . HINT: Just reproduce the proof to 
Theorem 3.21, p. 78. 

5.6: 

REMARK In view of part (b) of the preceding exercise, we see that whether a function / is 
integrable or not is totally independent of the values of the function at a fixed finite set of points. 
Indeed, the function needn't even be defined at a fixed finite set of points, and still it can be 
integrable. This observation is helpful in many instances, e.g., in parts (d) and (e) of Exercise 5.21. 

Theorem 5.6: 

The assignment / — ► J f on J ([a, b}) satisfies the following properties. 

1. (Linearity) I([a,b}) is a vector space, and J (af + f3g) = a J f + (3 J g for all f,g s 
I ([a, 6])and a,/3 £ R. 

2. (Positivity) If / (x) > for all x £ [a, b] , then / / > 0. 

3. (Order-preserving) If /, g £ I ([a, b]) and / (x) < g (x) for all x £ [a, b] , then J f < J g. 

4. If/e/([o,6]),thensois|/|,and|//|</|/|. ' 

5. If / is the uniform limit of functions /„, each of which is in I([a,b\) , then / £ I([a,b\) and 
// = limjfn. 

6. Let {u n } be a sequence of functions in / ([a, b}) . Suppose that for each n there is a number m n , 
for which \u n (x) \ < m n for all x £ [a,b] , and such that the infinite series ^ m n converges. 
Then the infinite series Yl u n converges uniformly to an integrable function, and J J2u n = 

Proof: 

That I([a,b\) is a vector space was proved in part (2) of Theorem 5.5, p. 126. Let / and g be in 
i" ([a, b\) , and write / = limh n and g = limk n , where the h n 's and the fc„'s are step functions. Then 
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a f + 09 = ^ rn { a h n + /3k n ) , so that, by Theorem 5.2, p. 121 and the definition of the integral, we 
have 

/ {af + (3g) = Urn J (ah n + (ik n ) 
= lim(ajh„ + f3jk„) 
= alim J h n + (Him J k n 
ajf + pfg, 

which proves part (1). 

Next, if / e / ([a, 6]) satisfies / (x) > for all x e [a, b] , let {l n } be a nonincreasing sequence of 
step functions that converges uniformly to /. See part (d) of Exercise 5.9. Then /„ (x) > f (x) > 
for all x and all n. So, again by Theorem 5.2, p. 121, we have that 



/ = lim l n > 0. (5.26) 

This proves part (2). 

Part (3) now follows by combining parts (1) and (2) just as in the proof of Theorem 5.2, p. 121. 
To see part (4), let / s I([a,b]) be given. Write / = limh n . Then |/| = lim\h n \. For 

\\f{x)\-\h n {x)\\<\f{x)-h n {x)\. (5.27) 

Therefore, |/| is integrable. Also, 



|/| = Urn / \h n \ > lim\ / h n \ = \lim K\ = \ f\. (5.28) 

To see part (5), let {/„} be a sequence of elements of / ([a, 6]) , and suppose that / = limf n . For 
each n, let h n be a step function on [a, b] such that |/„ (x) — h n (x) \ < \jn for all x € [a, b] . Note 
also that it follows from parts (3) and (4) that 

I ( In- / /i„|<—. (5.29) 



Now {hn} converges uniformly to /. For, 

\f(x)-h n (x)\ < \f(x)-f n (x)\ + \f n (x)-h n (x)\ 

showing that / = limh n . Therefore, / e I([a,b}). Moreover, J f = lim J h n . Finally, J f = 
limjf n , for 

\If-Ifn\ < \Jf-Jh n \ + \fh n -ff n \ 
< \Jf-fhn\+ b -^. 

This completes the proof of part (5). 

Part (6) follows directly from part (5) and the Weierstrass M Test (Theorem 3.19, Weierstrass 
M-Test, p. 77). For, part (1) of that theorem implies that the infinite series J2 u n converges 
uniformly, and then J J2 u n = J2 J u n follows from part (5) of this theorem. 

As a final extension of our notion of integral, we define the integral of certain complex- valued functions. 
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Definition 5.7: 

Let [a, b] be a fixed bounded and closed interval. A complex- valued function / = u + iv is called 
integrable if its real and imaginary parts u and v are integrable. In this case, we define 

b t>b />b t>b 

/=/ (u + iv)= u+i v. (5.32) 

J a J a J a J a 

Theorem 5.7: 

1. The set of all integrable complex- valued functions on [a, b] is a vector space over the field of 
complex numbers, and 

cb t'b t'b 

(af + (3g) = a f + (3 g (5.33) 

J a J a 

for all integrable complex-valued functions / and g and all complex numbers a and f3. 

2. If / is an integrable complex-valued function on [a, b] , then so is |/|, and \ J f\ < J \f\. 

Proof: 

We leave the verification of part (1) to the exercise that follows. 



To see part (2), suppose that / is integrable, and write / = u + iv. Then |/| = y/u 2 + v 2 , so that 
|/| is integrable by Theorem 5.5, p. 126 and part (e) of Exercise 5.9. Now write z = J f, and write 
z in polar coordinates as z = re 10 , where r = \z\ = \ J f\. (See Exercise 4.23 (Polar coordinates).) 
Define a function g by g (x) = e~ t6 f (x) and notice that \g\ = \f\. Then J g = e~ t6 J f = r, which 

is a real number. Writing g =u +i v, we then have that r = J u +i J v , implying that J v= 0. So, 



I/. 6 /I 



r b 
J a 9 

r b " 
U 
Ja 



= 


l/> 


< 


/> 


< 


la \9\ 


= 


! b a\f\ 



(5.34) 



as desired. 

Exercise 5.12 

Prove part (1) of the preceding theorem. 

HINT: Break a, /?, J f, and J g into real and imaginary parts. 



5.4 The Fundamental Theorem of Calculus 4 

We begin this section with a result that is certainly not a surprise, but we will need it at various places in 
later proofs, so it's good to state it precisely now. 



This content is available online at <http://cnx.Org/content/m36210/l.2/>. 
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Theorem 5.8: 

Suppose / e I {[a, b]) , and suppose a < c < b. Then / e I ([a, c]) ,/ € / ([c, 6]) , and 

/= / /+ / /• (5-35) 



Proof: 

Suppose first that h is a step function on [a, b] , and let P = {xo < x\ < ... < x n } be a partition 
of [a, b] such that ft, (ir) = o, on the subinterval (a^j_i,a;j) of P. Of course, we may assume without 
loss of generality that c is one of the points of P, say c = Xk- Clearly h is a step function on both 
intervals [a, c] and [c, 6] . 

Now, let Qi = {a = xo < xi < ... < c = Xk} be the partition of [a,c] obtained by intersecting 
P with [a,c\ , and let Q2 = {c = Xk < Xk+i < ... < x n = b] be the partition of [c,b] obtained by 
intersecting P with [c, 6] . We have that 

X> = s p( h ) 

= Y,i=l a i{ x i - x i-l) +YJi=k+l a ii X i ~ X i-l) (5.36) 

5 Ql (h) + S Q2 (h) 

I> + J b c h, 

which proves the theorem for step functions. 

Now, write / = limh n , where each h n is a step function on [a, 6] . Then clearly / = limh n on 
[a, c] , which shows that / s I ([a,c\) , and 



/ = lira I h n . (5.37) 

J a 

Similarly, / = limh n on [c, b] , showing that f £ I ([c,b]) , and 



b rb 

f = lim h n - (5.38) 



Finally, 



llf = UmJ h a h n 



\ b ' (5-39) 

Urn / Q c h n + lira j c h n 



as desired. 



I's time for the trumpets again! What we call the Fundamental Theorem of Calculus was discovered 
by Newton and Leibniz more or less simultaneously in the seventeenth century, and it is without doubt 
the cornerstone of all we call mathematical analysis today. Perhaps the main theoretical consequence of 
this theorem is that it provides a procedure for inventing "new" functions. Polynomials are rather natural 
functions, power series are a simple generalization of polynomials, and then what? It all came down to 
thinking of a function of a variable x as being the area beneath a curve between a fixed point a and the 
varying point x. By now, we have polished and massaged these ideas into a careful, detailed development 
of the subject, which has substantially obscured the original ingenious insights of Newton and Leibniz. On 
the other hand, our development and proofs are complete, while theirs were based heavily on their intuition. 
So, here it is. 
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Theorem 5.9: Fundamental Theorem of Calculus 

Suppose / is an arbitrary element of I ([a,b]) . Define a function F on [a, b] by F (x) = J f. Then: 

1. F is continuous on [a, b] , and F (a) = 0. 

2. If / is continuous at a point c € (a, b) , then F is different iable at c and F' (c) = / (c) . 

3. Suppose that / is continuous on [a, b] . If G is any continuous function on [a, b] that is differ- 
entiable on (a, b) and satisfies G' (x) = f (x) for all x € (a, b) , then 

b 
f{t)dt=G{b)-G{a). (5.40) 

REMARK Part (2) of this theorem is the heart of it, the great discovery of Newton and Leibniz, 
although most beginning calculus students often think of part (3) as the main statement. Of course 
it is that third part that enables us to actually compute integrals. 
Proof: 

Because / e I([a,b}) , we know that / e I([a,x}) for every x € [a,b] , so that F (x) at least is 
defined. 

Also, we know that / is bounded; i.e., there exists an M such that \f (t) \ < M for all t € [a, b] . 
Then, if x, y € [a, b] with x > y, we have that 

\F(x)-F(y) 

= \I V af + S X v f-S V a f\ 

i r f\ 

(5.41) 

M{x-y), 

so that \F (x) - F(y)\ < M\x — y\ < s if \x — y\ < S = e/M. This shows that F is (uniformly) 
continuous on [a, 6] . Obviously, F (a) = J / = 0, and part (1) is proved. 

Next, suppose that / is continuous at c € (a, b) , and write L = f (c) . Let e > be given. 
To show that F is different iable at c and that F' (c) = /(c), we must find a S > such that if 
< \h\ < 5 then 

,F(c+h)-F(c) , 

— ' —-L\<£. 5.42 

h 

Since / is continuous at c, choose 5 > so that \f (t) — f (c) | < e if \t — c\ < S. Now, assuming 
that h > for the moment, we have that 

F(c+h)-F(c) = s: +h f-r a f 

= JZf+Cf-Kf ( 5 - 43 ) 



= 


i /;/-/: /i 


= 


i/. y /+/;/-/ vi 


= 


i/;/i 


< 


/; i/i 


< 


Tm 



and 

rc+h 



jc+n L 



(5.44) 
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So, if < h < 6, then 



F(c+h)-F(c) 


~L\ 


= 


, /;+" m dt /;+» l 


h 


1 h h 








,/;+"(/(*)-!,) dt. 








1 h 1 






< 


/ c c+h |/(i)-£| dt 








h 








s: +h \f(t)-f(c)\dt 








h 






< 


s: +h * 








h 






= 


£, 



(5.45) 



where the last inequality follows because for t € [c, c + h] , we have that \t — c\ < h < 5. A similar 
argument holds if h < 0. (See the following exercise.) This proves part (2). 

Suppose finally that G is continuous on [a, b] , differentiable on (a, b) , and that G' (x) = f (x) 
for all x e (a, b) . Then, F — G is continuous on [a, b] , differentiable on (a, b) , and by part (2) 
(F - G) (x) = F' (ar)-G' (a;) = / (x)-f (x) = for all x € {a, b) . It then follows from Exercise 4.12 
that F — G is a constant function C, whence, 

G(b)-G{a) = F(b) + C-F(a)-C = F(b)= f f (t) dt, (5.46) 

J a 

and the theorem is proved. 

Exercise 5.13 

a. Complete the proof of part (2) of the preceding theorem; i.e., take care of the case when 
h< 0. HINT: In this case, a<c+h<c. Then, write / Q c / = J^ +h f + J^ +h f. 

b. Suppose / is a continuous function on the closed interval [a, b] , and that /' exists and is 
continuous on the open interval (a, b) . Assume further that /' is integrable on the closed 
interval [a, b] . Prove that / (x) — f (a) = J f for all x € [a, b] . Be careful to understand how 
this is different from the Fundamental Theorem. 

c. Use the Fundamental Theorem to prove that for x > 1 we have 

f x 1 
ln(x) =F(x)= / -dt, (5.47) 

and for < x < 1 we have 

ln{x) = F{x) = - -dt. (5.48) 

J X * 

HINT: Show that these two functions have the same derivative and agree at x = 1. 



5.5 Consequences of the Fundamental Theorem 5 

The first two theorems of this section constitute the basic "techniques of integration" taught in a calculus 
course. However, the careful formulations of these standard methods of evaluating integrals have some subtle 
points, i.e., some hypotheses. Calculus students are rarely told about these details. 



5 This content is available online at <http://cnx.Org/content/m36211/l.2/>. 
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Theorem 5.10: Integration by Parts Formula 

Let / and g be integrable functions on [a, b] , and as usual let F and G denote the functions defined 

by 



Then 



F(x) = f, andG(x) = / g. (5.49) 

J a J a 



b /•& 

fG=[F(b)G(b)-F(a)G(a)}- Fg. (5.50) 



Or, recalling that / = F and g = G 

,-b ,-b 



/>0 />0 

/ F'G=[F{b)G{b)-F{a)G{a)}- FG' . (5.51) 

•J a J a 



Exercise 5.14 



a. Prove the preceding theorem. HINT: Replace the upper limit b by a variable x, and differen- 
tiate both sides. By the way, how do we know that the functions Fg and fG are integrable? 

b. Suppose / and g are integrable functions on [a, b] and that both /' and g are continuous on 
(a, b) and integrable on [a, b] . (Of course /' and g are not even defined at the endpoints a and 
b, but they can still be integrable on [a, b] . See the remark following Exercise 5.11.) Prove 
that 

" fg = [f (&) g(b)-f (a) g (a)} - f fg. (5.52) 

•J a 

Theorem 5.11: Integration by Substitution 

Let / be a continuous function on [a, b] , and suppose g is a continuous, one-to-one function from 
[c, d] onto [a, b] such that g is continuously different iable on (c, d) , and such that a = g (c) and 
b = g (d) . Assume finally that g is integrable on [c, d] . Then 

I f(t)dt= I f(g(s))g'(s)ds. (5.53) 

J a J c 

Proof: 

It follows from our assumptions that the function / (g (s)) g (s) is continuous on (a, b) and inte- 
grable on [c, d] . It also follows from our assumptions that g maps the open interval (c, d) onto the 
open interval (a, b) . As usual, let F denote the function on [a, 6] defined by F (x) = J f (t) dt. 
Then, by part (2) of the Fundamental Theorem, F is differentiable on (a, b) , and F' = f. Then, by 
the chain rule, F o g is continuous and differentiable on (c, d) and 

(F o g) ( 8 ) = F' (g («)) g (s) = f (g (a)) g (a) . (5.54) 

So, by part (3) of the Fundamental Theorem, we have that 

j'figisDg^ds = J c d (F o g) (a) ds 

= (F o g) (d) - (F o g) ( C ) 

= F(g(d))-F(g(c)) (5.55) 

F(b)-F(a) 
I b a f(t)dt, 
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which finishes the proof. 

Exercise 5.15 

a. Prove the "Mean Value Theorem" for integrals: If / is continuous on [a, b] , then there exists 
a c g (a, b) such that 

l-b 

f(t)dt = f(c)(b-a). (5.56) 

(Uniform limits of differentiable functions. Compare with Exercise 4.26.) Suppose {/ n } is 
a sequence of continuous functions on a closed interval [a, b] that converges pointwise to 
a function /. Suppose that each derivative f' n is continuous on the open interval (a, b) , is 
integrable on the closed interval [a, b] , and that the sequence {f' n } converges uniformly to 
a function g on (a, b) . Prove that / is differentiable on (a, b) , and /' = g. HINT: Let x be 
in (a, b) , and let c be in the interval (a, x) . Justify the following equalities, and use them 
together with the Fundamental Theorem to make the proof. 



f (x) - f (c) = lim (f„ (x) - f n (c)) = Urn f„= 9- (5.57) 

J c J c 

We revisit now the Remainder Theorem of Taylor, which we first presented in Theorem 4.19, Taylor's 
Remainder Theorem, p. 107. The point is that there is another form of this theorem, the integral form, and 
this version is more powerful in some instances than the original one, e.g., in the general Binomial Theorem 
below. 

Theorem 5.12: Integral Form of Taylor's Remainder Theorem 

Let c be a real number, and let / have n + 1 derivatives on (c — r,c+ r) , and suppose that 
/(™ +1 ) e I ([c — r,c + r]) . Then for each c < x < c + r, 

f (x) - T? M (x) = f /("+ 1 ) (t) fc^ dt, (5.58) 



where TJ denotes the nth Taylor polynomial for /. 
Similarly, for c — r < x < c, 



f (x) - T ( " /iC) (x) = f /(" +1 ) (t) { ^-f- dt. (5.59) 

Exercise 5.16 

Prove the preceding theorem. 

HINT: Argue by induction on n, and integrate by parts. 

5.7: 

REMARK We return now to the general Binomial Theorem, first studied in Theorem 4.21, p. 
110. The proof given there used the derivative form of Taylor's remainder Theorem, but we were 
only able to prove the Binomial Theorem for \t\ < 1/2. The theorem below uses the integral form 
of Taylor's Remainder Theorem in its proof, and it gives the full binomial theorem, i.e., for all t for 
which |£| < 1. 

Theorem 5.13: General Binomial Theorem 
Let a = a + bi be a fixed complex number. Then 

oo 



fc=0 
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for all £G (-1,1). 
Proof: 

For clarity, we repeat some of the proof of Theorem 4.21, p. 110. Given a general a = a + bi, 
consider the function g : ( — 1, 1) — » C defined by g (t) = (1 + t) a . Observe that the nth derivative 
of g is given by 

g w( t) = a(a -l ) "i;-" +1) . (5.6D 



a + *r 



Then g G C°° ((-1, 1)) . 

For each nonnegative integer k define 



iu\ , - , . a (a — I) ... (a — k + I) / a\ 
a k = 9 ik) (0) /kl = -± '—£ " = (*)' ( 5 - 62 ) 

and set h (t) = J2T=o a kt k ■ The radius of convergence for the power series function h is 1, as was 
shown in Exercise 4.31. We wish to show that g (t) = h (i) for all — 1 < t < 1. That is, we wish to 
show that g is a Taylor series function around 0. It will suffice to show that the sequence {S n } of 
partial sums of the power series function h converges to the function g. We note also that the nth 
partial sum is just the nth Taylor polynomial T™ for g. 

Now, fix a t strictly between and 1. The argument for i's between —1 and is completely 
analogous.. Choose an e > for which (3 = (l + e)t < 1. We let C e be a numbers such that 
I (") I ^ Ce(l + £ ) n f° r a ll nonnegative integers n. See Exercise 4.31. We will also need the following 
estimate, which can be easily deduced as a calculus exercise (See part (d) of Exercise 4.11.). For 
all s between and t, we have (t — s) / (1 + s) < t. Note also that, for any s G (0,i) , we have 
| (1 + s) a \ = (1 + s) a , and this is trapped between 1 and (1 + t) a . Hence, there exists a number M t 
such that | (1 + s) a | < M t for all s G (—0, t) . We will need this estimate in the calculation that 
follows. 

Then, by the integral form of Taylor's Remainder Theorem, we have: 

ls(*)-£Lo«fc* fc l = \g(t)-T-(t)\ 

\Io9 {n+1) (s)^ L ds\ 

= l/o( (T ^i. ) i Xa )(l + *) a " n_1 (t-«) n d«l 

< /oi(„:i)ii(i+-r 1 i(-+i)i(f+fr^ (563) 

< Jo \ (nil) \ M t(n+ l) t n d S 

< C E M t {n+l)f*{l + e) n+1 t n ds 

C e M t {n+l){l + e) n+1 t n+1 
C e M t {n+l)(3 n+1 , 
which tends to as n goes to oo, because (3 < 1. This completes the proof for < t < 1. 



5.6 Area of Regions in the Plane 6 

It would be desirable to be able to assign to each subset S of the Cartesian plane R 2 a nonnegative real 
number A (S) called its area. We would insist based on our intuition that (i) if S is a rectangle with sides of 
length L and W then the number A (S) should be LW, so that this abstract notion of area would generalize 
our intuitively fundamental one. We would also insist that (ii) if S were the union of two disjoint parts, 



6 This content is available online at <http://cnx.Org/content/m36212/l.2/>. 
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S = S\ U £2, then A (S) should be A (Si) + A (S2) ■ (We were taught in high school plane geometry that 
the whole is the sum of its parts.) In fact, even if S were the union of an infinite number of disjoint parts, 
S = U™ =1 S n with Si n Sj ■ = if i / j, we would insist that (iii) A (S) = Y,n=i A ( S n) ■ 

The search for such a definition of area for every subset of R 2 motivated much of modern mathematics. 
Whether or not such an assignment exists is intimately related to subtle questions in basic set theory, e.g., 
the Axiom of Choice and the Continuum Hypothesis. Most mathematical analysts assume that the Axiom 
of Choice holds, and as a result of that assumption, it has been shown that there can be no assignment 
S — > A (S) satisfying the above three requirements. Conversely, if one does not assume that the Axiom of 
Choice holds, then it has also been shown that it is perfectly consistent to assume as a basic axiom that such 
an assignment S — » A (S) does exist. We will not pursue these subtle points here, leaving them to a course 
in Set Theory or Measure Theory. However, Here's a statement of the Axiom of Choice, and we invite the 
reader to think about how reasonable it seems. 

5.8: 

AXIOM OF CHOICE Let S be a collection of sets. Then there exists a set A that contains 
exactly one element out of each of the sets S in S. 

The difficulty mathematicians encountered in trying to define area turned out to be involved with defining 
A(S) for every subset S s R 2 . To avoid this difficulty, we will restrict our attention here to certain " 
reasonable" subsets S. Of course, we certainly want these sets to include the rectangles and all other common 
geometric sets. 

Definition 5.8: 

By a (open) rectangle we will mean a set R = (0,6) x (c,d) in R 2 . That is, R = {(x,y) : a < 
x < b and c < y < d}. The analogous definition of a closed rectangle[a, b] x [c,d] should be clear: 
[a, b] x [c, d] = {(x, y) : a < x < b, c < y < d} . 

By the area of a (open or closed) rectangle R = (a, b) x (c, d) or [a, b] x [c, d] we mean the number 
A (R) = (b - a) (d - c) . . 
The fundamental notion behind our definition of the area of a set S is this. If an open rectangle R = 
(a,b) x (c,d) is a subset of S, then the area A(S) surely should be greater than or equal to A (R) = 
(b — a) (d — c) . And, if S contains the disjoint union of several open rectangles, then the area of S should be 
greater than or equal to the sum of their areas. 

We now specify precisely for which sets we will define the area. Let [a, b] be a fixed closed bounded 
interval in R and let I and u be two continuous real-valued functions on [a, b] for which I (x) < u (x) for all 
x e (a, b) . 

Definition 5.9: 

Given [a, b] , I, and u as in the above, let S be the set of all pairs (x, y) G R 2 , for which a < x < b 
and I (x) < y < u (x) . Then S is called an open geometric set. If we replace the < signs with < 
signs, i.e., if S is the set of all (x, y) such that a < x < b, and I (x) < y < u (x) , then S is called a 
closed geometric set. In either case, we say that S is bounded on the left and right by the vertical 
line segments {(a, y) : 1(a) < y < u (a)} and {(b,y) : 1(b) < y < u (&)}, and it is bounded below by 
the graph of the function I and bounded above by the graph of the function u. We call the union 
of these four bounding curves the boundary of S, and denote it by Cs- 

If the bounding functions u and I of a geometric set S are smooth or piecewise smooth functions, 
we will call S a smooth or piecewise smooth geometric set. 

If S is a closed geometric set, we will indicate the corresponding open geometric set by the symbol S°. 

The symbol 5° we have introduced for the open geometric set corresponding to a closed one is the same 
symbol that we have used previously for the interior of a set. Study the exercise that follows to see that the 
two uses of this notation agree. 

Exercise 5.17 

a. Show that rectangles, triangles, and circles are geometric sets. What in fact is the definition 
of a circle? 
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b. Find some examples of sets that are not geometric sets. Think about a horseshoe on its side, 
or a heart on its side. 

c. Let / be a continuous, nonnegative function on [a, b] . Show that the "region" under the graph 
of / is a geometric set. 

d. Show that the intersection of two geometric sets is a geometric set. Describe the left, right, 
upper, and lower boundaries of the intersection. Prove that the interior (Si l~l 52) of the 
intersection of two geometric sets Si and 52 coincides with the intersection S® n S® °f their 
two interiors. 

e. Give an example to show that the union of two geometric sets need not be a geometric set. 

f. Show that every closed geometric set is compact. 

g. Let S be a closed geometric set. Show that the corresponding open geometric set S° coincides 
with the interior of S, i.e., the set of all points in the interior of S. HINT: Suppose a < x < b 
and I (x) < y < u (x) . Begin by showing that, because both I and u are continuous, there must 
exist an e > and a 6 > such that a < x — S < x + S < b and I (x) < y — e < y + e < u (x) . 

Now, given a geometric set S (either open or closed), that is determined by an interval [a, b] and two bounding 
functions u and I, let P = {xq < xi < ... < x n } be a partition of [a, 6] . For each 1 < i < n, define numbers 
Ci and di as follows: 

Ci = sup I (x) , and di = inf u (x) . (5.64) 

Xi-i<X<Xi Xi-^<X<Xi 

Because the functions I and u are continuous, they are necessarily bounded, so that the supremum and 
infimum above are real numbers. For each 1 < i < n define Ri to be the open rectangle (xi-\,Xj) x (ci,dj) . 
Of course, di may be < Cj, in which case the rectangle Ri is the empty set. In any event, we see that the 
partition P determines a finite set of (possibly empty) rectangles {Ri}, and we denote the union of these 
rectangles by the symbol Cp. = U™ =1 (xj_i, Xi) x (a, di) . 

The area of the rectangle Ri is (xi — Xi-i) (di — Cj) if Cj < di and otherwise. We may write in general 
that A (Ri) = (xi — Xi-i) max ((di — Cj) , 0) . Define the number Ap by 

n 

A P = ^ (xt - Xi-i) (di - c^ . (5.65) 

i=i 
Note that Ap is not exactly the sum of the areas of the rectangles determined by P because it may happen 
that di < Ci for some i's, so that those terms in the sum would be negative. In any case, it is clear that Ap 
is less than or equal to the sum of the areas of the rectangles, and this notation simplifies matters later. 
For any partition P, we have S 3 Cp, so that, if A (S) is to denote the area of S, we want to have 

MS) > T,7=iMRi) 

= Tl l i = i{ x t- x i-i) max {{d l -c i ),Q) 

> S"=l ( X * ~ Zi-l) (^ ~ C i) 



A P . 

Definition 5.10: 

Let S be a geometric set (either open or closed), bounded on the left by x = a, on the right by 
x = 6, below by the graph of I, and above by the graph of u. Define the areaA (S) of S by 

n 

A (S) = supAp = sup ^2 ( x i ~ x i-i) (di - Ci) , (5.67) 

P P={x„<x 1 <...<x n } i=1 

where the supremum is taken over all partitions P of [a, b] , and where the numbers Cj and di are 
as defined above. 

Exercise 5.18 
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a. Using the notation of the preceding paragraphs, show that each rectangle Ri is a subset of 
the set S and that Ri n Rj = if i / j. It may help to draw a picture of the set S and the 
rectangles {Ri}. Can you draw one so that di < c{l 

b. Suppose S\ is a geometric set and that S2 is another geometric set that is contained in Si. 
Prove that A (S2) < A (Si) . HINT: For each partition P, compare the two Ap's. 

Exercise 5.19 

Let T be the triangle in the plane with vertices at the three points (0, 0) , (0, H) , and (B, 0) . Show 
that the area A (T) , as defined above, agrees with the formula A = (1/2) BH, where B is the base 
and H is the height. 

The next theorem gives the connection between area (geometry) and integration (analysis). In fact, this 
theorem is what most calculus students think integration is all about. 

Theorem 5.14: 

Let S be a geometric set, i.e., a subset of R 2 that is determined in the above manner by a closed 
bounded interval [a, b] and two bounding functions I and u. Then 

A(S) = j (u(x)-l(x)) dx. (5.68) 

Proof: 

Let P = {xq < x\ < ... < x n } be a partition of [a, b] , and let Cj and di be defined as above. Let h 
be a step function that equals di on the open interval [xi-\,Xi) , and let k be a step function that 
equals q on the open interval (xi-i, X{) . Then on each open interval (x^-i, Xi) we have h (x) < u [x) 
and k (x) > I (x) . Complete the definitions of h and k by defining them at the partition points so 
that h (x^ = k (x^ for all i. Then we have that h (x) — k (x) < u (x) — I (x) for all x € [a, b] . Hence, 

A P = J2^i-Xi-i)(di-Ci)= {h-k)< (u-l). (5.69) 

Since this is true for every partition P of [a, b] , it follows by taking the supremum over all partitions 
P that 



A(S) = supAp < / (u(x)-l(x)) 

P Ja 



dx, (5.70) 



which proves half of the theorem; i.e., that A (S) < J u— I. 

To see the other inequality, let h be any step function on [a, b] for which h (x) < u (x) for all x, 
and let k be any step function for which k (x) > I (x) for all x. Let P = {xo < x\ < ... < x n } be 
a partition of [a,b] for which both h and k are constant on the open subintervals (a;,_i,cCj) of P. 
Let a\, a2, ••-, a n and 61, 62, •••, b n be the numbers such that h (x) = di on (a;,_i, Xi) and k (x) = bi 
on (xi_i,Xi) . It follows, since h(x) < u(x) for all x, that Oj < di. Also, it follows that 6, > Cj. 
Therefore, 

/f) n n 

(h-k) = Y J (o» - bi) (xi - Xi-i) <Y,( x i~ x *-i) K -Ci) = A P <A (S) . (5.71) 

»=i »=i 

Finally, let {h m } be a nondecreasing sequence of step functions that converges uniformly to u, and 
let {km] be a nonincreasing sequence of step functions that converges uniformly to I. See part (d) 
of Exercise 5.9. Then 

6 ,-b 

(u-l) = lim (h m -k m )<A(S), (5.72) 
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which proves the other half of the theorem. 

OK! Trumpet fanfares, please! 

Theorem 5.15: 

(A = irr 2 .) If S is a circle in the plane having radius r, then the area A (S) of S is irr 2 . 
Proof: 

Suppose the center of the circle S is the point (h, k) . This circle is a geometric set. In fact, we may 
describe the circle with center (h, k) and radius r as the subset S of R 2 determined by the closed 
bounded interval [h — r,h + r] and the functions 



u(x) = k+ ^Jr 2 - (x - hf (5.73) 

and 



l( x ) = k- \Jr 2 -{x-h). (5.74) 

By the preceding theorem, we then have that 



i-h+r 

A(S)= I 2\/r 2 - {x - h) z dx = Trr 2 . (5.75) 

Jh-r 



We leave the verification of the last equality to the following exercise. 

Exercise 5.20 

Evaluate the integral in the above proof: 

h+r 



2\Jr 2 - {x-hfdx. (5.76) 

' h— r 

Be careful to explain each step by referring to theorems and exercises in this book. It may seem 
like an elementary calculus exercise, but we are justifying each step here. 

5.9: 
REMARK There is another formula for the area of a geometric set that is sometimes very useful. 
This formula gives the area in terms of a "double integral." There is really nothing new to this 
formula; it simply makes use of the fact that the number (length) u (x) — I (x) can be represented 
as the integral from I (x) to u (x) of the constant 1. Here's the formula: 

r b ( r u ( x ) \ 
A{S)= / 1 dy \ dx. (5.77) 

The next theorem is a result that justifies our definition of area by verifying that the whole is equal to 
the sum of its parts, something that any good definition of area should satisfy. 

Theorem 5.16: 

Let S be a closed geometric set, and suppose S = U™_ 1 5'i, where the sets {Si} are closed geometric 
sets for which Sf n 5° = if i ^ j. Then 



A(S)=J2A(Si). (5.78) 



Proof: 

Suppose S is determined by the interval [a, b] and the two bounding functions I and u, and suppose 
Si is determined by the interval [aj, 6j] and the two bounding functions U and itj. Because Si C S, it 
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must be that the interval [ejj, 6j] is contained in the interval [a, 6] . Initially, the bounding functions k 
and Ui are defined and continuous on [a,, 6^] , and we extend their domain to all of [a, b] by defining 
li (x) = Ui (x) = for all x s [a, b] that are not in [aj,6j] . The extended functions li and Ui may 
not be continuous on all of [a, b] , but they are still integrable on [a, b] . (Why?) Notice that we now 
have the formula 

A (Si) = / ( Ui (x) - k (x)) dx = (Ui (x) - h (x)) dx. (5.79) 

Next, fix an x in the open interval (a, b) . We must have that the vertical intervals (li (x) , w, (x)) 
and (lj (x) ,Uj (x)) are disjoint if i / j. Otherwise, there would exist a point y in both intervals, 
and this would mean that the point (x,y) would belong to both S® and S?, which is impossible 
by hypothesis. Therefore, for each x € (a, b) , the intervals {(li () x) ,Ui (x)} are pairwise disjoint 
open intervals, and they are all contained in the interval (I (x) , u (x)) , because the Si's are subsets 
of S. Hence, the sum of the lengths of the open intervals {(li (x) ,Ui (x))} is less than or equal to 
the length of (I (x) , u (x)) . Also, for any point y in the closed interval [I (x) , u (x)] , the point (x, y) 
must belong to one of the Si's, implying that y is in the closed interval [li (x) ,Ui (x)} for some i. 
But this means that the sum of the lengths of the closed intervals [li (x) , Ui (x)] is greater than or 
equal to the length of the interval [I (x) , u (x)] . Since open intervals and closed intervals have the 
same length, we then see that (u (x) — I (x) = J27=i ( u i ( x ) ~ h ( x )) • 
We now have the following calculation: 

ELi A ( s *) = ELi LI ( M * ( x ) _ l i ( a: )) dx 

= ELl Ja (Ui(x)-li(x)) dx 

= f b a ELAu t (x)-k( x ))dx (5.80) 

la ( U ( X )- l ( x )) dx 

A(S), 
which completes the proof. 



5.7 Extending the Definition of Integrability 7 

We now wish to extend the definition of the integral to a wider class of functions, namely to some that are 
unbounded and Others whose domains are not closed and bounded intervals. This extended definition is 
somewhat ad hoc, and these integrals are sometimes called "improper integrals." 

Definition 5.11: 

Let / be a real or complex- valued function on the open interval (a, b) where a is possibly — oo and 
b is possibly +oo. We say that / is improperly-integmble on (a, b) if it is integrable on each closed 
and bounded subinterval [a , b'] C (a, b) , and for each point c € (a, b) we have that the two limits 

limb — > b — J f and lim a <^ a+0 J , f exist. 

More generally, We say that a real or complex-valued function /, not necessarily defined on all 
of the open interval (a, b) , is improperly-integmble on (a, b) if there exists a partition {xi} of [a, b] 
such that / is defined and improperly-integrable on each open interval (xi_i,Xi) . 

We denote the set of all functions / that are improperly-integrable on an open interval (a, b) by 
h((a,b)). 
Analogous definitions are made for a function's being integrable on half-open intervals [a, b) and (a, b] . 



7 This content is available online at <http://cnx.Org/content/m36222/l.2/>. 
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Note that, in order for / to be improperly- integrable on an open interval, we only require / to be defined 
at almost all the points of the interval, i.e., at every point except the endpoints of some partition. 

Exercise 5.21 

a. Let / be defined and improperly-integrable on the open interval (a, b) . Show that 
Um a - ^o+o / ' / + Km(,%5_o / /is the same for all c G (a, b) . 

— 1/2 

b. Define a function / on (0, 1) by / (x) = (1 — x) . Show that / is improperly-integrable on 
(0, 1) and that / is not bounded. (Compare this with part (1) of Theorem 5.5, p. 126.) 

c. Define a function g on (0, 1) by g (x) = (1 — x)~ . Show that g is not improperly-integrable 
on (0, 1) , and, using part (b), conclude that the product of improperly-integrable functions on 
(0, 1) need not itself be improperly-integrable. (Compare this with part (3) of Theorem 5.5, 
p. 126.) 

d. Define h to be the function on (0, oo) given by h (x) = 1 for all x. Show that h is not 
improperly-integrable on (0,oo). (Compare this with parts (4) and (5) of Theorem 5.5, p. 
126.) 

Part (a) of the preceding exercise is just the consistency condition we need in order to make a definition of 
the integral of an improperly-integrable function over an open interval. 

Definition 5.12: 

Let / be defined and improperly-integrable on an open interval (a, b) . We define the integral of / 
over the interval (a, b) , and denote it by J f, by 

6 re r b' 



f = Urn f+ Urn f. (5.81) 

a'^a+0 7 a ' b'-*b-0J c 

In general, if / is improperly-integrable over an open interval, i.e., / is defined and improperly- 
integrable over each subinterval of (a, b) determined by a partition {xi}, then we define the integral 
of / over the interval (a, b) by 



/b n fXi 



f. (5.82) 



Theorem 5.17: 

Let (a, b) be a fixed open interval (with a possibly equal to — oo and b possibly equal to +oo, and 
let Ii ((o, b)) denote the set of improperly-integrable functions on (a, b) . Then: 

1. U ((a, b)) is a vector space of functions. 

2. (Linearity) f* (a/ + j3g) = a f* f + (3 f* g for all f,g€h ((a, 6))and a,f3eC. 

3. (Positivity) If / (x) > for all x € {a, b) , then | Q 6 / > 0. 

4. (Order-preserving) If /, g € h ((a, b)) and / (x) < g (x) for all x € (a, b) , then J f < J g. 

Exercise 5.22 

a. Use Theorem 5.5, p. 126, Theorem 5.6, p. 127, Theorem 5.7, p. 129, and properties of limits 
to prove the preceding theorem. 

b. Let / be defined and improperly-integrable on (a, b) . Show that, given an e > 0, there exists 

a S > such that for any a < a < a + 5 and any b — S < b' < b we have \ J f\ + \ L f\ < s. 

c. Let / be improperly-integrable on an open interval (a, b) . Show that, given an e > 0, there 
exists a S > such that if (c, d) is any open subinterval of (a, 6) for which d — c < S, then 
| / f\ < e. HINT: Let {xi} be a partition of [a, b] such that / is defined and improperly- 
integrable on each subinterval (xi-\,Xi) . For each i, choose a Si using part (b). Now / is 
bounded by M on all the intervals [ccj_i + Si, Xi — Si] , so S = e/M should work there. 
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d. Suppose / is a continuous function on a closed bounded interval [a, b] and is continuously 
differentiable on the open interval (a, b) . Prove that /' is improperly-integrable on (a, b) , and 
evaluate J f . HINT: Fix a point c G (a, b) , and use the Fundamental Theorem of Calculus 
to show that the two limits exist. 

e. (Integration by substitution again.) Let g : [c, d] — > [a, b] be continuous on [c, d] and satisfy 
g (c) = a and g (d) = b. Suppose there exists a partition {xo < x\ < ... < x n } of the interval 
[c,d] such that g is continuously differentiable on each subinterval (xi-i,Xi) . Prove that g' is 
improperly-integrable on the open interval (c, d) . Show also that if / is continuous on [a, b] , 
we have that 

I f(t)dt= I f(g(s))g'(s)ds. (5.83) 

J a J c 

HINT: Integrate over the subintervals (xi-i,Xi) , and use part (d). 

5.10: 

REMARK Note that there are parts of Theorem 5.5, p. 126 and Theorem 5.6, p. 127 that are 
not asserted in Theorem 5.17, p. 141. The point is that these other properties do not hold for 
improperly-integrable functions on open intervals. See the following exercise. 

Exercise 5.23 

a. Define / to be the function on [l,oo) given by f (x) = (— l) n_ jn if n — 1 < x < n. Show 
that / is improperly-integrable on (1, oo) , but that |/| is not improperly-integrable on (1, oo) . 
(Compare this with part (4) of Theorem 5.6, p. 127.) HINT: Verify that J t f is a partial 

sum of a convergent infinite series, and then verify that J 1 |/| is a partial sum of a divergent 
infinite series. 

b. Define the function / on (1, oo) by / (x) = 1/x. For each positive integer n, define the function 
/„ on (l,oo) by /„ (x) = 1/x if 1 < x < n and /„ (x) = otherwise. Show that each /„ is 
improperly-integrable on (1, oo) , that / is the uniform limit of the sequence {f n }, but that / 
is not improperly-integrable on (l,oo) . (Compare this with part (5) of Theorem 5.6.) 

c. Suppose / is a nonnegative real- valued function on the half-open interval (a, oo) that is in- 
tegrable on every closed bounded subinterval [a, 6] . For each positive integer n > a, define 
Un = / / (a?) dx. Prove that / is improperly-integrable on [a, oo) if and only if the sequence 
{y n } is convergent. In that case, show that J f = limy n . 

We are now able to prove an important result relating integrals over infinite intervals and convergence of 
infinite series. 

Theorem 5.18: 

Let / be a positive function on [1, oo) , assume that / is integrable on every closed bounded interval 
[1,6] , and suppose that / is nonincreasing; i.e., if x < y then / (x) > f (y) . For each positive integer 

i, set Oj = / (i) , and let Sn denote the iVth partial sum of the infinite series Yl a i '-Sn = Si=i a i- 

Then: 



1. For each N, we have 



2. For each N, we have that 



S N - oi < / / (a;) dx < SV-i- (5.84) 



r-N 

Sn-i — I f {x) dx < a\ — a,N < ai; (5.85) 



i.e., the sequence {SW^i — J ± /} is bounded above. 
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3. The sequence {Sn-i — /j /} is nondecreasing. 

4. (Integral Test) The infinite series E a i converges if and only if the function / is improperly- 
integrable on (1, oo) . 

Proof: 

For each positive integer TV, define a step function Um on the interval [1,N] as follows. Let 
P = {xq < x\ < ... < xn-i} be the partition of [1,N] given by the points {1 < 2 < 3 < ... < TV}, 
i.e., Xi = i + 1. Define kn (x) to be the constant Cj = / (i + 1) on the interval [xi-i,Xi) = [i, i + 1) . 
Complete the definition of Um by setting Um (TV) = f (N) . Then, because / is nonincreasing, we 
have that k^ (x) < f (x) for all x G [1, N] . Also, 

Jl k N = S,=l Ci(Xi-Xi-l) 

= E^/Ci + i) 

EL/W ( 5 - 86 ) 






which then implies that 



Sn — a-i = kjsi (x) dx < f (x) dx. (5.87) 

This proves half of part (1). 

For each positive integer N > 1 define another step function In, using the same partition P as 
above, by setting Zjy (x) = f (i) if i < x < i + 1 for 1 < i < N, and complete the definition of l^ 
by setting In (N) = / (TV) . Again, because / is nonincreasing, we have that / (x) < In (x) for all 
x e [1,TV]. Also 

If In = E^/W 

= E^ 1 o< ( 5 - 88 ) 

= Sn-i, 



which then implies that 



JV /-JV 

/ (a;) dx < In (x) dx = Sn-i, (5.89) 



and this proves the other half of part (1). 
It follows from part (1) that 

Sn-i — f (x) dx < Sn-i — Sn + a\ = ai — cln, (5.90) 

and this proves part (2). 

We see that the sequence {Sn-i — J 1 /} is nondecreasing by observing that 

„ r N+l j. „ f if, r N+l . 

ON ~ J 1 f - &N-1 + Ji J = a N-J N J 

= f(N)-J^ +1 f (5.91) 

> 0, 

because / is nonincreasing. 
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Finally, to prove part (4), note that both of the sequences {Sn} and {J 1 /} are nondecreasing. 

If / is improperly-integrable on [1, oo) , then lim^ J 1 f exists, and Sn < «i + /j / (%) dx for all 
N, which implies that J2 a i converges by Theorem 2.14, p. 48. Conversely, if J2 a i converges, then 
liraSN exists. Since J ± f (x) dx < Sn-i, it then follows, again from Theorem 2.14, p. 48, that 
lirriN /i / (x) dx exists. So, by the preceding exercise, / is improperly-integrable on [l,oo) . 

We may now resolve a question first raised in Exercise 2.32. That is, for 1 < s < 2, is the infinite series 
Y^ l/n s convergent or divergent? We saw in that exercise that this series is convergent if s is a rational 
number. 

Exercise 5.24 

a. Let s be a real number. Use the Integral Test to prove that the infinite series J2 \/n s is 
convergent if and only if s > 1. 

b. Let s be a complex number s = a + bi. Prove that the infinite series Yl l/n s is absolutely 
convergent if and only if a > 1 . 

Exercise 5.25 

Let / be the function on [l,oo) defined by / (x) = 1/x. 

a. Use Theorem 5.18, p. 142 to prove that the sequence {X^;=i ^ ~~ InN} converges to a positive 
number 7 < 1. (This number 7 is called Euler's constant.) HINT: Show that this sequence is 
bounded above and nondecreasing. 

b. Prove that 

V ' '- = ln2. (5.92) 

i=l 

HINT: Write S2N for the 27Vth partial sum of the series. Use the fact that 

2JV N 

^ = E-- 2 E^- ( 5 - 93 ) 

i=l i=l 



Now add and subtract ln(2N) and use part (a). 



5.8 Integration in the Plane 8 

Let S be a closed geometric set in the plane. If / is a real- valued function on S, we would like to define what 
it means for / to be "integrable" and then what the "integral" of / is. To do this, we will simply mimic our 
development for integration of functions on a closed interval [a, b] . 

So, what should be a "step function" in this context? That is, what should is a "partition" of S be in 
this context? Presumably a step function is going to be a function that is constant on the "elements" of a 
partition. Our idea is to replace the subintervals determined by a partition of the interval [a, b] by geometric 
subsets of the geometric set S. 

Definition 5.13: 

The overlap of two geometric sets S\ and S2 is defined to be the interior (S± (1 S2) of their 
intersection. Si and 6*2 are called nonoverlapping if this overlap (S\ PI S2) is the empty set. 



s This content is available online at <http://cnx.Org/content/m36223/l.2/>. 
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Definition 5.14: 

A partition of a closed geometric set S in R 2 is a finite collection {Si, S2, •••, S n } of nonoverlapping 
closed geometric sets for which U™_ 1 5'j = 5; i.e., the union of the Si's is all of the geometric set S. 

The open subsets {S®} are called the elements of the partition. 

A step function on the closed geometric set 5 is a real- valued function h on S for which there 
exists a partition P = {Si} of S such that h(z) = a,i for all z € Sf; i.e., h is constant on each 
element of the partition P. 

5.11: 

REMARK One example of a partition of a geometric set, though not at all the most general 
kind, is the following. Suppose the geometric set S is determined by the interval [a, b] and the two 
bounding functions u and I. Let {x n < xi < ... < x n } be a partition of the interval [a, b] . We make 
a partition {Si} of S by constructing vertical lines at the points Xj from I (xi) to u (x^ . Then Si 
is the geometric set determined by the interval [xi-i, Xi] and the two bounding functions Ui and k 
that are the restrictions of u and I to the interval [xi-i, Xi] . 

A step function is constant on the open geometric sets that form the elements of some partition. We say 
nothing about the values of h on the "boundaries" of these geometric sets. For a step function h on an interval 
[a, b] , we do not worry about the finitely many values of h at the endpoints of the subintervals. However, in 
the plane, we are ignoring the values on the boundaries, which are infinite sets. As a consequence, a step 
function on a geometric set may very well have an infinite range, and may not even be a bounded function, 
unlike the case for a step function on an interval. The idea is that the boundaries of geometric sets are 
"negligible" sets as far as area is concerned, so that the values of a function on these boundaries shouldn't 
affect the integral (average value) of the function. 

Before continuing our development of the integral of functions in the plane, we digress to present an 
analog of Theorem 3.21, p. 78 to functions that are continuous on a closed geometric set. 

Theorem 5.19: 

Let / be a continuous real-valued function whose domain is a closed geometric set S. Then there 
exists a sequence {h n } of step functions on S that converges uniformly to /. 
Proof: 

As in the proof of Theorem 3.21, p. 78, we use the fact that a continuous function on a compact 
set is uniformly continuous. 

For each positive integer n, let S n be a positive number satisfying \f (z) — f (w) | < 1/n if 
\z — w\ < S n . Such a S n exists by the uniform continuity of / on S. Because S is compact, it 
is bounded, and we let R = [a, b] x [c, d] be a closed rectangle that contains S. We construct a 
partition {5™} of S as follows. In a checkerboard fashion, we write R as the union U_R™ of small, 
closed rectangles satisfying 

1. If z and w are in Rf, then \z — w\ < S n . (The rectangles are that small.) 

2. Rf n Kf = 0. (The interiors of these small rectangles are disjoint.) 

Now define S? = S n Rf. Then Sf n Sf = 0, and S = US™. Hence, {S"f} is a partition of S. 

For each i, choose a point z™ in S™, and set o™ = / (2:™) . We define a step function h n as follows: 
If z belongs to one (and of course only one) of the open geometric sets 5™ , set h n (z) = a™. And, if 
z does not belong to any of the open geometric sets S™ , set h n (z) = f (z) . It follows immediately 
that h n is a step function. 

Now, we claim that \f (z) — h n (z) \ < 1/n for all z € S. For any z in one of the S™ 0, s, we have 

\f(z) - h n (z) \ = \f z - a?\ = \f(z) -f(z?) I < 1/n (5.94) 

because \z — zf\ < S n . And, for any z not in any of the S™ 0, s, f (z) — h n (z) = 0. So, we have 
defined a sequence {h n } of step functions on S, and the sequence {h n } converges uniformly to / by 
Exercise 3.29. 
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What follows now should be expected. We will define the integral of a step function h over a 
geometric set S by 

/n 
h=y j a i xA{S i ). (5.95) 

We will define a function / on S to be integrable if it is the uniform limit of a sequence {h n } of 
step functions, and we will then define the integral of / by 



/ = lim h n . (5.96) 

s J s 

Everything should work out nicely. Of course, we have to check the same two consistency questions 

we had for the definition of the integral on [a, b] , i.e., the analogs of Theorem 5.1, p. 119 and 

Theorem 5.4, p. 123. 

Theorem 5.20: 

Let S be a closed geometric set, and let ftbea step function on S. Suppose P = {Si, ..., S n } and 
Q = {Xi, ...,T m } are two partitions of S for which h(z) is the constant Oj on S 1 ? and h(z) is the 
constant bj on T^ . Then 

n m 

J2^A(S l ) = Y,b J A(T J ). (5.97) 

»=1 3=1 

Proof: 

We know by part (d) of Exercise 5.17 that the intersection of two geometric sets is itself a geometric 
set. Also, for each fixed index j, we know that the sets {Tj n Sf} are pairwise disjoint. Then, by 
Theorem 5.16, p. 139, we have that A (Tj) = 5^™=i -^ (^i ^ ^*) ■ Similarly, for each fixed i, we have 
that A (Si = J2T=i ^ {Tj fl Si) . Finally, for each pair i and j, for which the set T® n Sf is not 
empty, choose a point Zij G Tj n Sf, and note that Oj = h (zij) = bj, because Zij belongs to both 



f and If. 
With these observations, we then have that 



Er=iM(Si) = Er=iOiEr=i^( r i ns *) 

= ELiEJli^^n^) 

= E?=iEr=i fc («ij)^( r i ns i) 

= Er=iEr=iM(^nSi) (5.98) 

= E7 =1 ^Er=i^(^n^) 

which completes the proof. 

OK, the first consistency condition is satisfied. Moving right along: 

Definition 5.15: 

Let h be a step function on a closed geometric set S. Define the integral of h over the geometric 
set S by the formula 

f h= [ H(z) dz = ^r<XiA(Si) , (5.99) 

J s J s „_-, 
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where Si, ..., S n is a partition of S for which h is the constant a% on the interior Sf of the set Sj. 

Just as in the case of integration on an interval, before checking the second consistency result, we need 
to establish the following properties of the integral of step functions. 

Theorem 5.21: 

Let H (S) denote the vector space of all step functions on the closed geometric set S. Then the 
assignment h — » / h of H (S) into R has the following properties: 

1. (Linearity) H (S) is a vector space, and J „ (hi + h 2 ) = J s h\ + J s h 2 , and J g ch = cj „h for 
all hi, hi, h g H (S) , and for all real numbers c. 

2. If h = J^r=i c iXSi i s a linear combination of indicator functions of geometric sets that are 
subsets of S, then J h = Yl7=i c i A (^) ■ 

3. (Positivity) If h(z)>0 for all z e S, then J s h > 0. 

4. (Order-preserving) If hi and h 2 are step functions on S for which hi (z) < h 2 (z) for all z s S, 
then f s h± < J s h 2 - 

Proof: 

Suppose hi is constant on the elements of a partition P = {Si} and /i 2 is constant on the elements 
of a partition Q = {Tf\. Let V be the partition of the geometric set S whose elements are the sets 
{Uk} = {Sj° fl Tj}. Then both hi and h 2 are constant on the elements Uk of V, so that hi + h 2 is 
also constant on these elements. Therefore, hi + h 2 is a step function, and 

J (hi + h 2 ) = j2 (0* + ^) ^ (^) = Yl a ^ A ( u k) + J2 bkA (^t) = / ft i + / ft 2' ( 5 - 10 °) 

and this proves the first assertion of part (1). 

The proof of the other half of part (1), as well as parts (2), (3), and (4), are totally analogous 
to the proofs of the corresponding parts of Theorem 5.2, p. 121, and we omit the arguments here. 

Now for the other necessary consistency condition: 

Theorem 5.22: 

let S be a closed geometric set in the plane. 

1. If {h n } is a sequence of step functions that converges uniformly to a function / on S, then 
the sequence {J s h n } is a convergent sequence of real numbers. 

2. If {h n } and {k n } are two sequences of step functions on S that converge uniformly to the 
same function /, then 



lira \ h n = lira I k n . (5.101) 

J s J s 

Exercise 5.26 

Prove Theorem 5.22, p. 147. Mimic the proofs of Theorem 5.3, p. 123 and Theorem 5.4, p. 123. 

Definition 5.16: 

If / is a real-valued function on a closed geometric set S in the plane, then / is integmble on S if 
it is the uniform limit of a sequence {h n } of step functions on S. 
We define the integral of an integrable function / on S by 



/= / f(z) dz = lim h n , (5.102) 

s J s J s 

where {h n } is a sequence of step functions on S that converges uniformly to /. 

Theorem 5.23: 

Let S be a closed geometric set in the plane, and let I (S) denote the set of integrable functions 
on S. Then: 
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1. / (S) is a vector space of functions. 

2. If / and g s / (5) , and one of them is bounded, then fg s I (5) . 

3. Every step function is in I (S) . 

4. If / is a continuous real- valued function on S, then / is in I (S) . That is, every continuous 
real- valued function on S is integrable on S. 

Exercise 5.27 

a. Prove Theorem 5.23, p. 147. Note that this theorem is the analog of Theorem 5.5, p. 126, 
but that some things are missing. 

b. Show that integrable functions on S are not necessarily bounded; not even step functions have 
to be bounded. 

c. Show that, if / e I (S) , and g is a function on S for which / (x, y) = g (x, y) for all (a;, y) in 
the interior S° of S, then g € I (S) . That is, integrable functions on S can do whatever they 
like on the boundary. 

Theorem 5.24: 

Let S be a closed geometric set. The assignment / — » J f on / (S) satisfies the following properties. 

1. (Linearity) / (S) is a vector space, and f „ (af + j3g) = aj s f + (3f s g for all f,gel (S)and 
a, (3 e R. 

2. (Positivity) If f(z)>0 for all z e S, then j s f > 0. 

3. (Order-preserving) If /, g G / (S) and / (z) < g (z) for all z € S, then J s f < J s g. 

4. If / e / (S) , then so is |/|, and |/^/| < / g |/|. 

5. If / is the uniform limit of functions /„, each of which is in I (S) , then / e I (S) and 
J s f = limf s f n . 

6. Let {u n } be a sequence of functions in / (S) , and suppose that for each n there is a number 
m n , for which \u n (z) \ < m n for all z e S, and such that the infinite series J2m n converges. 
Then the infinite series J2 u n converges uniformly to an integrable function, and J s J2 u n = 

7. If / € / (S) , and {Si, ..., S n } is a partition of S, then f & I (Si) for all i, and 

n ~ 

= E/ /• ( 5 - 103 ) 

Exercise 5.28 

Prove Theorem 5.24, p. 148. It is mostly the analog to Theorem 5.6, p. 127. To see the last 

part, let hi be the step function that is identically 1 on <!?,; check that hif € I {Si) ; then examine 

EJsfhi- 

Of course, we could now extend the notion of integrability over a geometric set S to include complex- valued 

functions just as we did for integrability over an interval [a, b] . However, real- valued functions on geometric 

sets will suffice for the purposes of this book. 

We include here, to be used later in Section 7.1, a somewhat technical theorem about constructing 
partitions of a geometric set. 

Theorem 5.25: 

Let Si,...,S n be closed, nonoverlapping, geometric sets, all contained in a geometric set S. Then 

there exists a partition Si,---, Sm of S such that for 1 < i < n we have Si = Si- In other words, 
the Sj's are the first n elements of a partition of S. 
Proof: 

Suppose S is determined by the interval [a, b] and the two bounding functions u and I. We prove 
this theorem by induction on n. 
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If n = 1, let Si be determined by the interval [ai, 61] and the two bounding functions u\ and l\. 
Set Si = Si, and define four more geometric sets S2, ■■■, S5 as follows: 

1. 52 is determined by the interval [a, 01] and the two bounding functions u and I restricted to 
that interval. 

2. S3 is determined by the interval [ai,6i] and the two bounding functions u and u\ restricted 
to that interval. 

3. S4 is determined by the interval [01,61] and the two bounding functions I and li restricted 
to that interval. 

4. S*5 is determined by the interval [61, b] and the two bounding functions u and I restricted to 
that interval. 

Observe that the five sets Si, S2, ■ ■■, S5 constitute a partition of the geometric set S, proving the 
theorem in the case n = 1. 

Suppose next that the theorem is true for any collection of n sets satisfying the hypotheses. 
Then, given Si,..., S n +i as in the hypothesis of the theorem, apply the inductive hypothesis to the 
n sets Si, ...,S n to obtain a partition T 1; ..., T m of S for which T; = Si for all 1 < i < n. For each 
n + 1 < i < m, consider the geometric set S^ = S„+i fl Tj of the geometric set Tj. We may apply 
the case n = 1 of this theorem to this geometric set to conclude that S^ is the first element S^ x of 
a partition {S\ A , S' i2 , ...,S im .} of the geometric set Tj. 

Define a partition {Sk} of S as follows: For 1 < k < n, set Sk = Tk- Set Sn+i = ^"Ln+i^l 1 = 

S n +i. And define the rest of the partition {Sk} to be made up of the remaining sets S\ , for 

n+ 1 < i < m and 2 < j < mj. It follows directly that this partition {Sk} satisfies the requirements 
of the theorem. 

Exercise 5.29 

Let Si, ..., S n be as in the preceding theorem. Suppose Sk is determined by the interval [ak, bk] and 
the two bounding functions Uk and Ik ■ We will say that Sk is "below" Sj , equivalently Sj is "above" 
Sk, if there exists a point x such that Uk (x) < lj (x) . Note that this implies that x € [ak, &fe]n[oj, bj] . 

a. Suppose Sk is below Sj, and suppose (z,yk) € Sk and (z,yj) € Sj. Show that yj > yk- That 
is, if Sk is below Sj, then no part of Sk can be above Sj. 

b. Suppose S2 is below Si and S3 is below S2. Show that no part of S3 can be above Si. HINT: 
By way of contradiction, let xi € [«i,6i] be such that u 2 (xi) < li (xi) ; let x 2 € [02,^2] be 
such that u 3 (^2) < h (^2) ; and suppose a; 3 e [03,63] is such that Ui (x^) < l^ (x 3 ) . Derive 
contradictions for all possible arrangements of the three points xi,X2, and X3. 

c. Prove that there exists an index fco such that Sk is minimal in the sense that there is no other 
Sj that is below Sk„- HINT: Argue by induction on n. Thus, let {T/} be the collection of all 
S/c's that are below Si, and note that there are at most n— 1 elements of {T;}. By induction, 
there is one of the XJ's, i.e., an Sk that is minimal for that collection. Now, using part (b), 
show that this Sk must be minimal for the original collection. 

There is one more concept about integrating over geometric sets that we will need in later chapters. We have 
only considered sets that are bounded on the left and right by straight vertical lines and along the top and 
bottom by graphs of continuous functions y = u (x) and y = I (x) . We equally well could have discussed sets 
that are bounded above and below by straight horizontal lines and bounded on the left and right by graphs 
of continuous functions x = I (y) and x = r (y) . These additional sets do not provide anything particularly 



150 CHAPTER 5. INTEGRATION, AVERAGE BEHAVIOR 

important, so we do not discuss them. However, there are times when it is helpful to work with geometric 
sets with the roles of horizontal and vertical reversed. We accomplish this with the following definition. 

Definition 5.17: 

Let S be a subset of R 2 . By the symmetric image of S we mean the set S of all points (x, y) € R 2 
for which the point (y, x) s S. 

The symmetric image of a set is just the reflection of the set across the y = x line in the plane. Note that 
the symmetric image of the rectangle [a, b] x [c, d] is again a rectangle, [c, d] x [a, b] , and therefore the area 
of a rectangle is equal to the area of its symmetric image. This has the implication that if the symmetric 
image of a geometric set is also a geometric set, then they both have the same area. The symmetric image 
of a geometric set doesn't have to be a geometric set itself. For instance, consider the examples suggested 
in part (b) of . But clearly rectangles, triangles, and circles have this property, for their symmetric images 
are again rectangles, triangles, and circles. For a geometric set, whose symmetric image is again a geometric 
set, there are some additional computational properties of the area of S as well as the integral of functions 
over S, and we present them in the following exercises. 

Exercise 5.30 

Suppose S is a closed geometric set, which is determined by a closed interval [a,b] and two 

bounding functions u (x) and I (x) . Suppose the symmetric image S of S is also a closed geometric 



set, determined by an interval 



a, b 



and two bounding functions u (x) and / (x) 



a. Make up an example to show that the numbers a and b need not have anything to do with 

the numbers a and b, and that the functions u and / need not have anything to do with the 
functions u and I. 

b. Prove that S and S have the same area. HINT: use the fact that the area of a geometric set 
is approximately equal to the sum of the areas of certain rectangles, and then use the fact 
that the area of the symmetric image of a rectangle is the same as the area of the rectangle. 

c. Show that for every point (x, y) G S, we must have a< y <b, and for every such y, we must 

have I (y) < x <u (y) . HINT: If (x,y) E S, then (y,x) eS ■ 

d. (d) Prove that the area A (S) of S is given by the formula 

r-b l-u(x) r-b fU{y) 

A{S)= / 1 dydx = L L 1 dxdy. (5.104) 

J a Jl(x) J a J I (y) 

(See 5.9, p. 139.) 

e. Let S be the right triangle having vertices (o, c) , (b, c) , and (6, d) , where d > c. Describe the 

symmetric image of S; i.e., find the corresponding a, b,u, and I . Use part (d) to obtain the 
following formulas for the area of S : 



Exercise 5.31 



A(S)= I I 1 dsdt =11 1 dsdt. (5.105) 

b+^(a-b) 



a. Prove that if S\ and S2 are geometric sets whose symmetric images are again geometric sets, 
then the symmetric image of the geometric set Si PI 6*2 is also a geometric set. 
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b. Suppose T is a closed geometric set that is contained in a closed geometric set S. Assume 
that both the symmetric images T and S are also geometric sets. If S is determined by an 
interval a, 5 and two bounding functions u and / , prove that 



rb ru(s) 
A (T) = L L xt (t, s) dtds, (5.106) 

Ja Jl(s) 

where \t is the indicator function of the set T; i.e., \t (t, s) = 1 if (t, s) s T, and \t (t, s) = 
if (t, s) $_ T. HINT: See the proof of Theorem 5.16, p. 139, give names to all the intervals and 
bounding functions, and in the end use part (d) of the preceding exercise. 
c. Suppose {Si} is a partition of a geometric set S, and suppose the symmetric images of S and 
all the Si's are also geometric sets. Suppose ft is a step function that is the constant Oj on 
the element Sf of the partition {Si}. Prove that J „h = X^™=i a »/s^S?i an d therefore that 





,6 


Mi) 


,-h 


,-u(s) 


h = 


/ 


/ h (t, s) dsdt = 


h 


/- h (t, s) dtds 


s 


J a " 


h(t) 


J a 


'l(s) 



(5.107) 



HINT: Use part (b). 



d. Let S be a geometric set whose symmetric image S is also a geometric set, and suppose / is 
a continuous function on S. Show that 



r f b /•«(*) rb t-u{s) 

If = I / f(t,s) dsdt = L L f (t, s) dtds. (5.108) 

J S Ja J Mi) J a J l (s) 



HINT: Make use of the fact that the step functions constructed in Theorem 5.19, p. 145 
satisfy the assumptions of part (c). Then take limits. 
e. Let S be the triangle in part (e) of the preceding exercise. If / is a continuous function on S, 
show that the integral of / over S is given by the formulas 

rb fd+i^ b {c-d) f d , 

/=/ / f (t, s) dsdt = / f{t,s)dtds. (5.109) 

S Ja Jc Jc J b+J=|j(o-6) 
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Chapter 6 

Integration over Smooth Curves in the 
Plane 

6.1 Integration Over Smooth Curves in the Plane C=2ir r 1 

In this chapter we will define what we mean by a smooth curve in the plane and what is meant by its arc 
length. These definitions are a good bit more tricky than one might imagine. Indeed, it is the subtlety of the 
definition of arc length that prevented us from defining the trigonometric functions in terms of wrapping the 
real line around the circle, a definition frequently used in high school trigonometry courses. Having made a 
proper definition of arc length, we will then be able to establish the formula C = 2irr for the circumference 
of a circle of radius r. 

By the "plane," we will mean R 2 = C, and we will on occasion want to carefully distinguish between 
these two notions of the plane, i.e., two real variables x and y as opposed to one complex variable z = x + iy. 
In various instances, for clarity, we will use notations like x + iy and (x, y) , remembering that both of these 
represent the same point in the plane. As x + iy, it is a single complex number, while as (x, y) we may think 
of it as a vector in R 2 having a magnitude and, if nonzero, a direction. 

We also will define in this chapter three different kinds of integrals over such curves. The first kind, called 
"integration with respect to arc length," will be completely analogous to the integral defined in for functions 
on a closed and bounded interval, and it will only deal with functions whose domain is the set consisting 
of the points on the curve. The second kind of integral, called a "contour integral," is similar to the first 
one, but it emphasizes in a critical way that we are integrating a complex-valued function over a curve in 
the complex plane C and not simply over a subset of R 2 . The applications of contour integrals is usually to 
functions whose domains are open subsets of the plane that contain the curve as a proper subset, i.e., whose 
domains are larger than just the curve. The third kind of integral over a curve, called a "line integral," is 
conceptually very different from the first two. In fact, we won't be integrating functions at all but rather a 
new notion that we call "differential forms." This is actually the beginnings of the subject called differential 
geometry, whose intricacies and power are much more evident in higher dimensions than 2. 

The main points of this chapter include: 

1. The definition of a smooth curve, and the definition of its arc length, 

2. the derivation of the formula C = 2irr for the circumference of a circle of radius r (Theorem 6.5, p. 
163), 

3. the definition of the integral with respect to arc length, 

4. the definition of a contour integral, 

5. the definition of a line integral, and 

6. Green's Theorem (Theorem 6.15, Green, p. 177). 



1 This content is available online at <http://cnx.Org/content/m36224/l.2/>. 
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6.2 Smooth Curves in the Plane 2 

Our first project is to make a satisfactory definition of a smooth curve in the plane, for there is a good bit 
of subtlety to such a definition. In fact, the material in this chapter is all surprisingly tricky, and the proofs 
are good solid analytical arguments, with lots of e's and references to earlier theorems. 

Whatever definition we adopt for a curve, we certainly want straight lines, circles, and other natural 
geometric objects to be covered by our definition. Our intuition is that a curve in the plane should be a 
"1-dimensional" subset, whatever that may mean. At this point, we have no definition of the dimension of 
a general set, so this is probably not the way to think about curves. On the other hand, from the point 
of view of a physicist, we might well define a curve as the trajectory followed by a particle moving in the 
plane, whatever that may be. As it happens, we do have some notion of how to describe mathematically the 
trajectory of a moving particle. We suppose that a particle moving in the plane proceeds in a continuous 
manner relative to time. That is, the position of the particle at time t is given by a continuous function 
f (t) = x (t) + iy (t) = (x (t) ,y (£)) , as t ranges from time a to time b. A good first guess at a definition of 
a curve joining two points z\ and zi might well be that it is the range C of a continuous function / that is 
defined on some closed bounded interval [a, b] . This would be a curve that joins the two points z\ = f (a) 
and zi = f (b) in the plane. Unfortunately, this is also not a satisfactory definition of a curve, because of 
the following surprising and bizarre mathematical example, first discovered by Guiseppe Peano in 1890. 

6.1: 

THE PEANO CURVE The so-called "Peano curve" is a continuous function / defined on the 
interval [0, 1] , whose range is the entire unit square [0, 1] x [0, 1] in R 2 . 

Be careful to realize that we're talking about the "range" of / and not its graph. The graph of a real- 
valued function could never be the entire square. This Peano function is a complex- valued function of a real 
variable. Anyway, whatever definition we settle on for a curve, we do not want the entire unit square to be 
a curve, so this first attempt at a definition is obviously not going to work. 

Let's go back to the particle tracing out a trajectory. The physicist would probably agree that the particle 
should have a continuously varying velocity at all times, or at nearly all times, i.e., the function / should 
be continuously differentiable. Recall that the velocity of the particle is defined to be the rate of change of 
the position of the particle, and that's just the derivative /' of /. We might also assume that the particle 
is never at rest as it traces out the curve, i.e., the derivative / (t) is never 0. As a final simplification, we 
could suppose that the curve never crosses itself, i.e., the particle is never at the same position more than 
once during the time interval from t = a to t = b. In fact, these considerations inspire the formal definition 
of a curve that we will adopt below. 

Recall that a function / that is continuous on a closed interval [a, b] and continuously differentiable on the 
open interval (a, 6) is called a smooth function on [a,b] . And, if there exists a partition {to < t\ < ... < t n } 
of [a, b] such that / is smooth on each subinterval [tj_i,tj], then / is called piecewise smooth on [a, b] . 
Although the derivative of a smooth function is only defined and continuous on the open interval (a, 6) , and 
hence possibly is unbounded, it follows from part (d) of Exercise 5.22 that this derivative is improperly- 
integrable on that open interval. We recall also that just because a function is improperly-integrable on an 
open interval, its absolute value may not be improperly-integrable. Before giving the formal definition of 
a smooth curve, which apparently will be related to smooth or piecewise smooth functions, it is prudent 
to present an approximation theorem about smooth functions. Exercise 3.20 asserts that every continuous 
function on a closed bounded interval is the uniform limit of a sequence of step functions. We give next a 
similar, but stronger, result about smooth functions. It asserts that a smooth function can be approximated 
"almost uniformly" by piecewise linear functions. 

Theorem 6.1: 

Let / be a smooth function on a closed and bounded interval [a, b] , and assume that |/'| is 
improperly-integrable on the open interval (a, b) . Given an e > 0, there exists a piecewise linear 
function p for which 



2 This content is available online at <http://cnx.Org/content/m36225/l.2/>. 



155 

1. |/ (x) — p (x) | < e for all x € [a, b] . 

2. J b a \f(x)-p(x)\dx<e. 

That is, the functions / and p are close everywhere, and their derivatives are close on average in 
the sense that the integral of the absolute value of the difference of the derivatives is small. 
Proof: 

Because / is continuous on the compact set [a, b] , it is uniformly continuous. Hence, let 5 > be 
such that if x, y € [a, b] , and \x — y\ < S, then \f (x) — f (y)\ < e/2. 

Because |/'| is improperly-integrable on the open interval (a, b) , we may use part (b) of Exer- 
cise 5.22 to find a 5' > 0, which may also be chosen to be < 5, such that / |/'| + /{,_$• |/'| < e/2, 
and we fix such a 5' . 

Now, because /' is uniformly continuous on the compact set [a+ d' ,b — S'] , there exists an 
a > such that \f (x) — f (y) \ < e/4 (b — a) if x and y belong to [a + S' ,b — S'] and \x — y\ < a. 
Choose a partition {xq < x\ < ... < x n } of [a, b] such that xq = a, x\ = a + S , x n -i = b— 5 ,x n = b, 
and Xi — Xi-i < min(5,a) for 2 < i < n — 1. Define p to be the piecewise linear function on 
[a, b] whose graph is the polygonal line joining the n + 1 points (a, f (x\)) ,{(#,, / (#i))} for 1 < 
i < n — 1, and (6, /(x n _i)) . That is, p is constant on the outer subintervals [a,a;i] and [x n _i,6] 
determined by the partition, and its graph between x\ and x n -\ is the polygonal line joining the 
points {(xi, f (xi)) , ...,(x n ^i, f (x n -i))}. For example, for 2 < i < n — 1, the function p has the 
form 

p(x) = f fo-i) + ; {Xi) ~ f{Xi - l) (x - au-0 (6.1) 

on the interval [#i_i, Xj] . So, p (x) lies between the numbers / (xi-i) and / (xi) for all i. Therefore, 

|/ (*) - p (x) I < |/ (a:) - / (x 4 ) | + |/ ( Xi ) -l(x)\<\f (x) - f (x t ) \ + \f (x t ) - f ( Xi -i) \ < e. (6.2) 

Since this inequality holds for all i, part (1) is proved. 

Next, for 2 < i < n — 1, and for each x G (xi-i,Xi) , we have p (x) = 
(/ (^i) — / ( x i-i)) I i x i ~ x i-i) j which, by the Mean Value Theorem, is equal to /' (yi) for some 
y t e (xi-i,Xi) . So, for each such x e (xi-i,Xi) , we have \f (x) - p (x) \ = \f (x) - f (y t ) |, and 
this is less than e/4 (b — a) , because \x — yi\ < a. On the two outer intervals, p (x) is a constant, 
so that p (x) = 0. Hence, 

Jllf'-p'l = YZJZJf'-Pl 

= ri/'i+Er= 2 1 i/'-p'i+/i_ 1 i/i 
< r^'i/'i+j^i/'i+i^rri 

< £. 



(6.3) 



The proof is now complete. 

6.2: 
REMARK It should be evident that the preceding theorem can easily be generalized to a piecewise 
smooth function /, i.e., a function that is continuous on [a, b] , continuously differentiable on each 
subinterval (ij_i,ij) of a partition {t < t\ < ... < t n }, and whose derivative /' is absolutely 
integrable on (a, b) . Indeed, just apply the theorem to each of the subintervals (t»_i, U) , and then 
carefully piece together the piecewise linear functions on those subintervals. 

Now we are ready to define what a smooth curve is. 
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Definition 6.1: 

By a smooth curve from a point z\ to a different point z 2 in the plane, we mean a set C C C that 
is the range of a 1-1, smooth, function : [a, b] — » C, where [a, 6] is a bounded closed interval in R, 
where z\ = (a) and Z2 = (6) , and satisfying </>' (t) ^ for all t G (a, 6) . 

More generally, if '■ [a, b] — > -R 2 is 1-1 and piecewise smooth on [a, b] , and if {io < t\ < ... < t n } 
is a partition of [a,b] such that </>' (t) / for all £ s (t»-i,t») , then the range C of is called a 
piecewise smooth curve from zi = (a) to z 2 = <f) (b) . 

In either of these cases, is called a parameterization of the curve C. 

Note that we do not assume that \<f\ is improperly-integrable, though the preceding theorem might have 
made you think we would. 

6.3: 

REMARK Throughout this chapter we will be continually faced with the fact that a given curve 
can have many different parameterizations. Indeed, if <f>i : [a, b] — > C is a parameterization, and 
if g : [c, d] — » [a, 6] is a smooth function having a nonzero derivative, then 2 (s) = 01 (<? («)) 
is another parameterization of C. Since our definitions and proofs about curves often involve a 
parametrization, we will frequently need to prove that the results we obtain are independent of the 
parameterization. The next theorem will help; it shows that any two parameterizations of C are 
connected exactly as above, i.e., there always is such a function g relating 0i and 02- 

Theorem 6.2: 

Let <fii : [a, b] — > C and 02 : [c, d] — » C be two parameterizations of a piecewise smooth curve 
C joining Z\ to z 2 . Then there exists a piecewise smooth function g : [c,d] — > [a, 6] such that 
<^2 (s) = 01 (<? (s)) for all s s [c, d] . Moreover, the derivative g of g is nonzero for all but a finite 
number of points in [c, d] . 
Proof: 

Because both 0i and 02 are continuous and 1-1, it follows from Theorem 3.10, p. 66 that the 
function g = 0^ 1 o 2 is continuous and 1-1 from [c, d] onto [a, b] . Moreover, from Theorem 3.11, p. 
67, it must also be that g is strictly increasing or strictly decreasing. Write 0! (t) = U\ (t) + ii>i (t) = 
(«i (t) ,vi (t)) , and 02 (s) = m 2 (s) + iw 2 (s) = («2 (s) , V2 (s)) • Let {a; < xi < ... < a; p } be a 
partition of [a, b] for which 0j is continuous and nonzero on the subintervals (xj-i,Xj) , and let 
{yo < Vi < ■■■ < Vq} be a partition of [c, d] for which 2 is continuous and nonzero on the subintervals 
{yk-iiVk) ■ Then let {so < s\ < ... < s n } be the partition of [c,d] determined by the finitely many 
points {j/fc} U {g^ 1 (xj)}. We will show that g is continuously different iable at each point s in the 
subintervals (sj_i,sj). 

Fix an s in one of the intervals (s,-i, Si) , and let t = 0] -1 (02 (s)) = g (s) . Of course this means 
that 0i (t) = 02 (s) , or u\ (t) = u 2 (s) and ^i (t) = v 2 (s) . Then £ is in some one of the intervals 
(xj-i,Xj) , so that we know that <p\ (t) ^ 0. Therefore, we must have that at least one of u\ (t) 
or v\ (t) is nonzero. Suppose it is v\ (t) that is nonzero. The argument, in case it is u\ (t) that 
is nonzero, is completely analogous. Now, because v\ is continuous at t and v\ (t) ^ 0, it follows 
that vi is strictly monotonic in some neighborhood (t — 6,t + 6) of t and therefore is 1-1 on that 
interval. Then v^ 1 is continuous by Theorem 3.10, p. 66, and is differentiable at the point v\ (t) 
by the Inverse Function Theorem. We will show that on this small interval g = v^ 1 o v 2 , and this 
will prove that g is continuously differentiable at s. 

Note first that if 2 (a) = x + iy is a point on the curve C, then v 2 ((f)^ 1 (x + iyj) = y. Then, 
for any r G [a, b] , we have 

vr 1 {v 2 {g- 1 (r))) = ^M^^OM^)))) 

v^ 1 ( Vl (r)) 



(6.4) 
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showing that u-f 1 o v 2 = g^ 1 = g. Hence g is continuously differentiable at every point s in the 

subintervals (s,_i, Sj) . Indeed g (a) = v^ 1 (v 2 (c)) v 2 (c) for all a near s, and hence g is piecewise 
smooth. 

Obviously, cf> 2 (s) = 0i {g (s)) for all s, implying that 4> 2 (s) = (j)\ (g {s}) g' (s) . Since <fi 2 (s) / 
for all but a finite number of points s, it follows that g (s) 7^ for all but a finite number of points, 
and the theorem is proved. 

Corollary 6.1: 

Let <pi and <fi 2 be as in the theorem. Then, for all but a finite number of points z = (j>\ (t) = 4> 2 (s) 
on the curve C, we have 

01 (*) _ 02 («) , Q 5 n 



I0i(*)l 102(5)1 
Proof: 
From the theorem we have that 

02 (*) = 0i (9 (a))g (*) = 0'i (t) g (s) (6.6) 

for all but a finite number of points s e (c, d) . Also, g is strictly increasing, so that g (s) > 
for all points s where g is differentiable. And in fact, g (s) 7^ for all but a finite number of s's, 
because g (s) is either (v^ 1 o v 2 ) (s) or (u^f 1 o u 2 ) (s) , and these are nonzero except for a finite 
number of points. Now the corollary follows by direct substitution. 

6.4: 

REMARK If we think of </>' (t) = (x (t) ,y (t)) as a vector in the plane R 2 , then the corollary 
asserts that the direction of this vector is independent of the parameterization, at least at all but a 
finite number of points. This direction vector will come up again as the unit tangent of the curve. 

The adjective "smooth" is meant to suggest that the curve is bending in some reasonable way, and 
specifically it should mean that the curve has a tangent, or tangential direction, at each point. We give the 
definition of tangential direction below, but we note that in the context of a moving particle, the tangential 
direction is that direction in which the particle would continue to move if the force that is keeping it on 
the curve were totally removed. If the derivative <j> (t) / 0, then this vector is the velocity vector, and its 
direction is exactly what we should mean by the tangential direction. 

The adjective "piecewise" will allow us to consider curves that have a finite number of points where there 
is no tangential direction, e.g., where there are "corners." 

We are carefully orienting our curves at the moment. A curve C from Z\ to z 2 is being distinguished from 
the same curve from z 2 to Z\, even though the set C is the same in both instances. Which way we traverse 
a curve will be of great importance at the end of this chapter, when we come to Green's Theorem. 

Definition 6.2: 

Let C, the range of <j> : [a,b] — > C, be a piecewise smooth curve, and let z = (x,y) = 0(c) be 
a point on the curve. We say that the curve C has a tangential direction at z, relative to the 
parameterization <f>, if the following limit exists: 

*™i0(i)-*i *™i<^)-0( C )r ( } 

If this limit exists, it is a vector of length 1 in R 2 , and this unit vector is called the unit tangent 
(relative to the parameterization <f>) to C at z. 

The curve C has a unit tangent at the point z if there exists a parameterization <j> for which the 
unit tangent at z relative to </> exists. 

Exercise 6.1 
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a. Restate the definition of tangential direction and unit tangent using the R 2 version of the 
plane instead of the C version. That is, restate the definition in terms of pairs (x,y) of real 
numbers instead of a complex number z. 

b. Suppose (j> : [a, b] — > C is a parameterization of a piecewise smooth curve C, and that t € (a, b) 
is a point where <fi is differentiable with <f> (t) / 0. Show that the unit tangent (relative to 
the parameterization <f>) to C at z = <p(t) exists and equals <fi' (t) /\<jj (t) |. Conclude that, 
except possibly for a finite number of points, the unit tangent to C at z is independent of the 
parameterization. 

c. Let C be the graph of the function / (£) = |t| for t G [—1,1] . Is C & smooth curve? Is it a 
piecewise smooth curve? Does C have a unit tangent at every point? 

d. Let C be the graph of the function / (t) = t 2 l z = {t 1 ^) for t e [— 1, 1] . Is C a smooth curve? 
Is it a piecewise smooth curve? Does C have a unit tangent at every point? 

e. Consider the set C that is the right half of the unit circle in the plane. Let <px : [— 1, 1] — > C 
be defined by 

<Pi(t)= (cos (*!), sin (*!)), (6 - 8) 

and let <fi 2 '■ [— 1, 1] — » C be defined by 

<h(t) = ( c ° s 3 f)' sm ( t3 f))- (6 - 9) 

Prove that <f>i and <f> 2 are both parameterizations of C. Discuss the existence of a unit tangent 
at the point (1,0) = 4>\ (0) = 4>2 (0) relative to these two parameterizations. 

f. Suppose <j) : [a, b] — > C is a parameterization of a curve C from z\ to Z2- Define ip on [a, 6] by 
ip (t) = cf) (a + b — t) . Show that ip is a parameterization of a curve from z 2 to z\. 

Exercise 6.2 

a. Suppose / is a smooth, real-valued function defined on the closed interval [a, b] , and let C C R 2 
be the graph of /. Show that C is a smooth curve, and find a "natural" parameterization 
(j> : [a, b] — * C of C. What is the unit tangent to C at the point (t, f (£))? 

b. Let z\ and z 2 be two distinct points in C, and define </> : [0, 1] — * c by </> (i) = (1 — t) z\ + tz 2 - 
Show that is a parameterization of the straight line from the point z\ to the point z 2 . 
Consequently, a straight line is a smooth curve. (Indeed, what is the definition of a straight 
line?) 

c. Define a function (p '■ [— r , r ] —* R 2 by <f> (t) = (t, \Jr 2 — t 2 ) . Show that the range C of <f> is a 
smooth curve, and that is a parameterization of C. 

d. Define <fi on [0, 7r/2) by <f> (t) = e lt . For what curve is <f> a parametrization? 

e. Let zi,Z2,---,z n be n distinct points in the plane, and suppose that the polygonal line joing 
these points in order never crosses itself. Construct a parameterization of that polygonal line. 

f. Let S be a piecewise smooth geometric set determined by the interval [a, 6] and the two 
piecewise smooth bounding functions u and /. Suppose z\ and z 2 are two points in the interior 
S° of S. Show that there exists a piecewise smooth curve C joining z\ to z 2 , i.e., a piecewise 



smooth function 



a, b 




C with 0(a) = Z\ and <f> b ) = z 2 , that lies entirely in 5°. 



g. Let C be a piecewise smooth curve, and suppose 4> : [a, b] — > C is a parameterization of C. 
Let [c, d] be a subinterval of [a, b] . Show that the range of the restriction of </> to [c, d] is a 
smooth curve. 

Exercise 6.3 

Suppose C is a smooth curve, parameterized by <f> = u + iv : [a, b] — ► C. 
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a. Suppose that u (t) ^ for all t G (a, b) . Prove that there exists a smooth, real- valued function 
/ on some closed interval [a , b'] such that C coincides with the graph of /. HINT: / should 
be something like v o u~ l . 

b. What if v (t) ^ for all t e (a, b)l 

Exercise 6.4 

Let C be the curve that is the range of the function <f> : \— 1, 1] — > C, where <j> (t) = t 3 + t 6 i. 

a. Is C a piecewise smooth curve? Is it a smooth curve? What points z\ and z% does it join? 

b. Is (j> a parameterization of CI 

c. Find a parameterization for C by a function ip : [3, 4] — > C. 

d. Find the unit tangent to C and the point + Oi. 

Exercise 6.5 

Let C be the curve parameterized by <fi : [—tt, it — e] — > C defined by <fi (t) = e lt = cos (t) + isin (t) . 

a. What curve does </> parameterize? 

b. Find another parameterization of this curve, but base on the interval [0, 1 — e] . 



6.3 Arc Length 3 

Suppose C is a piecewise smooth curve, parameterized by a function <f>. Continuing to think like a physicist, 
we might guess that the length of this curve could be computed as follows. The particle is moving with 
velocity <j> (t) . This velocity is thought of as a vector in R 2 , and as such it has a direction and a magnitude 
or speed. The speed is just the absolute value \<j> (t) | of the velocity vector <j> (t) . Now distance is speed 
multiplied by time, and so a good guess for the formula for the length L of the curve C would be 



L= \<j> (t)\dt. (6.10) 

•J a 

Two questions immediately present themselves. First, and of primary interest, is whether the function |</>'| 
is improperly-integrable on (a, 6)? We know by Exercise 5.22 that <f> itself is improperly-integrable, but we 
also know from Exercise 5.23 that a function can be improperly-integrable on an open interval and yet its 
absolute value is not. In fact, the answer to this first question is no (See Exercise 6.6 (A curve of infinite 
length).). We know only that \<f\ exists and is continuous on the open subintervals of a partition of [a, b] . 

The second question is more subtle. What if we parameterize a curve in two different ways, i.e., with two 
different functions <f>\ and 02? How do we know that the two integral formulas for the length have to agree? 
Of course, maybe most important of all to us, we also must justify the physicist's intuition. That is, we 
must give a rigorous mathematical definition of the length of a smooth curve and show that Formula ((6.10)) 
above does in fact give the length of the curve. First we deal with the independence of parameterization 
question. 

Theorem 6.3: 

Let C be a smooth curve joining (distinct) points z\ to Z2 in C, and let <fii : [a, b] — > C and 
<f>2 ■ [c, d] — > C be two parameterizations of C. Suppose \<fi' 2 \ is improperly-integrable on (c, d) . Then 
|</>i|is improperly-integrable on (a, b) , and 



/ 



NiWII dt= I \\<>:,(s)\\ ds. (6.ii) 



3 This content is available online at <http://cnx.Org/content/m36226/l.2/>. 
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Proof: 

We will use Theorem 6.2, p. 156. Thus, let g = cj)^ 1 o <f> 2 , and recall that g is continuous on [c,d] 
and continuously differentiable on each open subinterval of a certain partition of [c, d] . Therefore, 
by part (d) of Exercise 5.22, g is improperly-integrable on (c, d) . 

Let {xo < x\ < ... < x p } be a partition of [a, b] for which <p\ is continuous and nonzero on the 
subintervals (xj-\,Xj) . To show that |^| is improperly-integrable on (a,b) , it will suffice to show 
this integrability on each subinterval (xj-i, Xj) . Thus, fix a closed interval [a , &'] C (xj-i, Xj) , and 
let [c',d] be the closed subinterval of [c,d] such that g maps [c',d'] 1-1 and onto [a , b'] . Hence, 
by part (e) of Exercise 5.22, we have 

jjVi Wl* = tf\4>\(g(s))\g(s)ds 

= ^W 1 (g(s))\\ 9 , ()s\ds 

= I^\^(9(s))9'(s)\da 

= J*'\(<j>iog)'(s)\ds 

// \4>2 (*) I da 

< J? 1^2 (*) I ds, 

which, by taking limits as a goes to Xj-\ and b' goes to Xj, shows that \<p\\ is improperly-integrable 
over (xj-\,Xj) for every j, and hence integrable over all of (a,b) . Using part (e) of Exercise 5.22 
again, and a calculation similar to the one above, we deduce the equality 

d 

l^sl, (6-13) 

J a J c 

and the theorem is proved. 

Exercise 6.6: A curve of infinite length 

Let <j> : [0, 1] : R 2 be defined by (j) (0) = (0, 0) , and for t > 0,0 (t) = (t, tsin (1/t)) . Let C be the 
smooth curve that is the range of <j>. 

a. Graph this curve. 

b. Show that 



\4>'{t)\ = yi + W(iA)-^Z!) + £^va 

= \Jt 2 + t 2 sin 2 (1/t) - tsin (2/t) + cos 2 (1/t). 



c. Show that 



1 10' w i * = r w? + ^ - ^ + c ° s2 {t) m - (6 - i5) 



d. Show that there exists an e > so that for each positive integer n we have cos 2 (t) — 
sin (2t) /t > 1/2 for all t such that \t — mr\ < e. 

e. Conclude that \<f>'\ is not improperly-integrable on (0, 1) . Deduce that, if Formula ((6.10)) is 
correct for the length of a curve, then this curve has infinite length. 

Next we develop a definition of the length of a parameterized curve from a purely mathematical or geometric 
point of view. Happily, it will turn out to coincide with the physically intuitive definition discussed above. 
Let C be a piecewise smooth curve joining the points z\ and z 2 , and let cj> : [a, b] — > C be a parameterization 
of C. Let P = {a = to < ti < ... < t n = b} be a partition of the interval [a, b] . For each < j < n write 
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Zj = 4>{tj) , and think about the polygonal trajectory joining these points {zj} in order. The length L P of 
this polygonal trajectory is given by the formula 



4 = I>i-*i-i|. ( 6 - 16 ) 

and this length is evidently an approximation to the length of the curve C. Indeed, since the straight line 
joining two points is the shortest curve joining those points, these polygonal trajectories all should have a 
length smaller than or equal to the length of the curve. These remarks motivate the following definition. 

Definition 6.3: 

Let <f> : [a, b] — » C be a parameterization of a piecewise smooth curve C C C. By the lengthL^ of 
C, relative to the parameterization <j>, we mean the number L^ = sup P L P , where the supremum is 
taken over all partitions P of [a, b] . 

6.5: 

Of course, the supremum in the definition above could well equal infinity in some cases. Though it 
is possible for a curve to have an infinite length, the ones we will study here will have finite lengths. 
This is another subtlety of this subject. After all, every smooth curve is a compact subset of R 2 , 
since it is the continuous image of a closed and bounded interval, and we think of compact sets as 
being "finite" in various ways. However, this finiteness does not necessarily extend to the length of 
a curve. 

Exercise 6.7 

Let <f> : [a, b] — » R 2 be a parameterization of a piecewise smooth curve C, and let P and Q be two 
partitions of [a, b] . 

a. If P is finer than Q, i.e., QCP, show that Lq < L P . 

b. If <j>(t) = u (t) + iv (t) , express L P in terms of the numbers u (tj) and v (tj) . 

Of course, we again face the annoying possibility that the definition of length of a curve will depend on the 
parameterization we are using. However, the next theorem, taken together with Theorem 6.3, p. 159, will 
show that this is not the case. 

Theorem 6.4: 

If C is a piecewise smooth curve parameterized by <fi : [a, b] — ► C, then 

b 

\<f>'(t)\dt, (6.17) 

specifically meaning that one of these quantities is infinite if and only if the other one is infinite. 
Proof: 

We prove this theorem for the case when C is a smooth curve, leaving the general argument for 
a piecewise smooth curve to the exercises. We also only treat here the case when L^ is finite, also 
leaving the argument for the infinite case to the exercises. Hence, assume that <f> = u + iv is a 
smooth function on [a, b] and that L^ < oo. 

Let e > be given. Choose a partition P = {to < t\ < ... < t n } of [a, b] for which 

n 

L* - L% = L* - J2 10 (tj) - 4> (tj-!) | < e. (6.18) 

Because <fi is continuous, we may assume by making a finer partition if necessary that the tj's are 
such that \<f> (ii) — <fi (to) | < e and \<j> (t n ) — <p (£ n _i) | < e. This means that 

n-l 

L'0-^2\4>(t j )-4>(t j -i)\<3e. (6.19) 

i=2 
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The point of this step (trick) is that we know that </>' is continuous on the open interval (a, b) , but 
we will use that it is uniformly continuous on the compact set [t\, t n -i] ■ Of course that means that 
\<j>'\ is integrable on that closed interval, and in fact one of the things we need to prove is that \<f\ 
is improperly-integrable on the open interval (a, b) . 

Now, because (j> is uniformly continuous on the closed interval [ti,t n _i] , there exists a 5 > 
such that \<f> (t) — </>' (s) | < £ if \t — s\ < 6 and t and s are in the interval [t\, t n _i] . We may assume, 
again by taking a finer partition if necessary, that the mesh size of Pis less than this 6. Then, using 
part (f) of Exercise 5.9, we may also assume that the partition P is such that 

*n-l ™ _1 

|0' (t) \dt-J2 \4>' i'j) I (tj - tj-i) I < e (6.20) 

1 i=2 

no matter what points Sj in the interval (tj-\, tj) are chosen. So, we have the following calculation, 
in the middle of which we use the Mean Value Theorem on the two functions u and v. 

\L*-^\4>'{t)\dt\ 

\L+-i:]-Z\<Hti)-<i>(t J -i)\ 

-\Yr j Zl\4>{tj)-4>{t i -i)\-S t t ;- 1 W{t)\dt\ 

\u (tj) - u(t J --i)+t(t;fa)-t;(t 3 --i)|-J t *- 1 |0' (*) I dt\ 

E"=a 0^) -M^-i)) 2 + («(*,) -«tVi)) 2 

3e + | E^ 1 ^/( U ' ( Sj )f + (v (rj)) 2 (tj - t,-_i) 

< 3 £ + 1 e;=2 Vk(^)) 2 + («'(^)) 2 (*i - tj-o 

- j/r 1 1^' w i <fti (6.2i) 

+ YTjZl \^(u( S] )) 2 + (v(r 3 )f- ^(u( Sj )f + (v'( Sj )f\ (tj - ^J 

3 £ + ie^2 1 i^'(^)I(*j- *j- i) - sir 1 w w i rf *i 

+ E "=2 lV(«(^)) 2 + («'(^)) 2 - V / («(^)) 2 + («'(^)) 2 | (*j " tj-i) 



< 




< 




< 


3e 


= 


Se+IE^I 


— 


3e + 



< 



4 £ + V™- 1 I(^>j)) -("'(^)) I (, _ . X 

Z " J=2 V(«'(^)) 2 +(«'(^)) 2 +V(«'(^)) 2 +(«'(^)) 2 v J 



<- 1- I V^"- 1 I 11 ( r j)-" ( s i)\\ v ( r i)+ v ( s ])\ If _+ \ 

< 4e + E^2 \V (Tj) ~ V (Sj) | (tj ~ tj-!) 

< 4e + E"r 2 1 |0'(r i )-^(* J -)|(t J --ti-i) 

< 4e + E7r 2 1 e(t i -t J -_i) 

4e + £ (£„_! - ii) 

< £ (4 + 6 - a) . 
This implies that 



£(4 + 6-a)</ |0| < L* > + e(4 + 6-o). (6.22) 
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If we now let t\ approach o and t n -i approach b, we get 



,6 

L*-e(4 + 6-a) < / |</>'| < L^ + e (4 + b - a) , (6.23) 

J a 

which completes the proof, since e is arbitrary. 

Exercise 6.8 

a. Take care of the piecewise case in the preceding theorem. 

b. Take care of the case when L^ is infinite in the preceding theorem. 

We now have all the ingredients necessary to define the length of a smooth curve. 

Definition 6.4: 

Let C be a piecewise smooth curve in the plane. The length or arc lengthL = L (C) of C is defined 
by the formula 

L (C) = L^ = su P L P , (6.24) 

p 

where <j) is any parameterization of C. 

If z and w are two points on a piecewise smooth curve C, we will denote by L(z,w) the arc 
length of the portion of the curve between z and w. 

6.6: 

REMARK According to Theorem 6.3, p. 159 and Theorem 6.4, p. 161, we have the following 
formula for the length of a piecewise smooth curve: 

\<l>'(t)\dt, (6.25) 

J a 

where <j) is any parameterization of C. 

It should come as no surprise that the length of a curve C from Z\ to zi is the same as the length 
of that same curve C, but thought of as joining z 2 to Z\. Nevertheless, let us make the calculation 
to verify this. If <f> : [a, b] — » C is a parameterization of this curve from z\ to z 2 , then we have seen 
in part (f) of exercise 6.1 that tp : [a, b] — » C, defined by tp (t) = <p (a + b — t) , is a parameterization 
of C from Z2 to z\. We just need to check that the two integrals giving the lengths are equal. Thus, 

b t'b t'b t'b 

\^'(t)\dt= / \(/>'(a + b-t)(-l)\dt= / \<f>' {a+b-t)\dt= \</> (s)\ds, (6.26) 

J a J a J a 

where the last equality follows by changing variables, i.e., setting t = a + b — s. 

We can now derive the formula for the circumference of a circle, which was one of our main 
goals. TRUMPETS? 

Theorem 6.5: 

Let C be a circle of radius r in the plane. Then the length of C is 2irr. 
Proof: 

Let the center of the circle be denoted by (h, k) . We can parameterize the top half of the circle by 
the function <j> on the interval [0, n] by <j> (t) = h + rcos (t) + i(k + rsin (£)) . So, the length of this 
half circle is given by 

/•7T /*7T fK 

L= \<f>'(t)\dt= \ - rsin (t)+ir cos (t)\dt= rdt = irr. (6.27) 

Jo Jo Jo 

The same kind of calculation would show that the lower half of the circle has length nr, and hence 

the total length is 2irr. 
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The integral formula for the length of a curve is frequently not much help, especially if you really want 
to know how long a curve is. The integrals that show up are frequently not easy to work out. 

Exercise 6.9 

a. Let C be the portion of the graph of the function y = x 2 between x = and x = 1. Let 
<j> : [0, 1] — ► C be the parameterization of this curve given by <p(t) = t + t 2 i. Find the length 
of this curve. 

b. Define <f> : [— 0, n] — > C by (j> (t) = acos (t) + ibsin (t) . What curve does <fi parameterize, and 
can you find its length? 



6.4 Integration with Respect to Arc Length 4 

We introduce next what would appear to be the best parameterization of a piecewise smooth curve, i.e., a 
parameterization by arc length. We will then use this parameterization to define the integral of a function 
whose domain is the curve. 

Theorem 6.6: 

Let C be a piecewise smooth curve of finite length L joining two distinct points z\ to z^. Then 
there exists a parameterization 7 : [0, L] — > C for which the arc length of the curve joining 7 (t) to 
7 (u) is equal to \u — t\ for alH < u s [0, L] . 
Proof: 

Let 4> : [a, b] — > C be a parameterization of C. Define a function F : [a, b] — » [0, L] by 

F(t)= J \<f>' (s)\ds. (6.28) 

•J a 

In other words, F (t) is the length of the portion of C that joins the points z\ = <f>(a) and 
<j> (t) . By the Fundamental Theorem of Calculus, we know that the function F is continuous on 
the entire interval [a, b] and is continuously differentiable on every subinterval (£j_i,ij) of the 
partition P determined by the piecewise smooth parameterization (j>. Moreover, F' (t) = \<f (t)\ > 
for all t g (ti-i,ti) , implying that F is strictly increasing on these subintervals. Therefore, if 
we write s, = F (ti) , then the Sj's form a partition of the interval [0, L] , and the function F : 
(ti-i,ti) — > (sj_i,s») is invertible, and its inverse F~ l is continuously differentiable. It follows then 
that 7 = <j> o F^ 1 : [0, L] — > C is a parameterization of C. The arc length between the points 7 (£) 
and 7 (u) is the arc length between <j> (F -1 (£)) and (F -1 (u)) , and this is given by the formula 

i^\^(s)\ds = /r 1(u) i0'wid*-/r 1(t) i^wi^ 



F (F- 1 («)) - F (F- 1 0)) (6.29) 



which completes the proof. 



Corollary 6.2: 

If 7 is the parameterization by arc length of the preceding theorem, then, for all t e (s»-i, s») , we 
have I7' (s) J = 1. 



This content is available online at <http://cnx.Org/content/m36228/l.2/>. 
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Proof: 

We just compute 



|7 (a) | = \(<j>oF-i)'(s)\ 

= \^(F-i( S ))(F^)'( S )\ 

= \^> { f ^( s )\\ f'(f-h s )) \ ( 6 - 3 °) 

i, 



as desired. 



We are now ready to make the first of our three definitions of integral over a curve. This first one is 
pretty easy. 

Suppose C is a piecewise smooth curve joining z\ to zi of finite length L, parameterized by arc length. 
Recall that this means that there is a 1-1 function 7 from the interval [0, L] onto C that satisfies the 
condidition that the arc length betweenthe two points 7 (t) and 7 (s) is exactly the distance between the 
points t and s. We can just identify the curve C with the interval [0, L] , and relative distances will correspond 
perfectly. A partition of the curve C will correspond naturally to a partition of the interval [0, L] . A step 
function on the dcurve will correspond in an obvious way to a step function on the interval [0, L] , and the 
formula for the integral of a step function on the curve is analogous to what it is on the interval. Here are 
the formal definitions: 

Definition 6.5: 

Let C be a piecewise smooth curve of finite length L joining distinct points, and let 7 : [0, L] — » C 
be a parameterization of C by arc length. By a partition of C we mean a set {z n , Z\, ..., z n } of points 
on C such that Zj = 7 (tj) for all j, where the points {to < t\ < ... < t n } form a partition of the 
interval [0,L] . The portions of the curve between the points Zj-\ and Zj, i.e., the set r y(tj-i,tj) , 
are called the elements of the partition. 

A step fucntion on C is a real-valued function h on C for which there exists a partition 
{z ,zi, ..., z n } of C such that h (z) is a constant o^ on the portion of the curve between Zj-i 
and Zj. 

Before defining the integral of a step function on a curve, we need to establish the usual consistency 
result, encountered in the previous cases of integration on intervals and integration over geometric sets, the 
proof of which this time we put in an exercise. 

Exercise 6.10 

Suppose h is a function on a piecewise smooth curve of finite length L, and assume that there 
exist two partitions {zo, Zi, ■■■, z n } and {u>o, toi, •••, w m } of C such that h (z) is a constant a^ on the 
portion of the curve between Zk-i and Zk, and h(z) is a constant bj on the portion of the curve 
between Wj-i and Wj. Show that 

n m 

y^ j g k L(zk-i,Zk) = y^bjL(wj-i,Wj) . (6.31) 

fe=i j=i 

HINT: Make use of the fact that h o 7 is a step function on the interval [0, L] . 
Now we can make the definition of the integral of a step function on a curve. 
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Definition 6.6: 

Let h be a step function on a piecewise smooth curve C of finite length L. The integral, with 
respect to arc length of h over C is denoted by J c h (s) ds, and is defined by 

/n 
h (s) ds = y^ a jL {zj-i, Zj) , (6.32) 

c i=i 

where {zo,zi, ...,z n } is a partition of C for which h(z) is the constant Oj on the portion of C 
between Zj-i and Zj. 

Of course, integrable functions on C with respect to arc length will be defined to be functions that are 
uniform limits of step functions. Again, there is the consistency issue in the definition of the integral of an 
integrable function. 

Exercise 6.11 

a. Suppose {h n } is a sequence of step functions on a piecewise smooth curve C of finite length, 
and assume that the sequence {h n } converges uniformly to a function /. Prove that the 
sequence {J c h n (s) ds} is a convergent sequence of real numbers. 

b. Suppose {h n } and {k n } are two sequences of step functions on a piecewise smooth curve C 
of finite length I, and that both sequences converge uniformly to the same function /. Prove 
that 



lim / h n (s) ds = lim I k n (s) ds. (6.33) 

J c J c 

Definition 6.7: 

Let C be a piecewise smooth curve of finite length L. A function / with domain C is called 
integrable with respect to arc length on C if it is the uniform limit of step functions on C. 

The integral with respect to arc length of an integrable function / on C is again denoted by 
J c f (s) ds, and is defined by 



/ (s) ds = lim / h n (s) ds, (6.34) 

c J c 

where {h n } is a sequence of step functions that converges uniformly to / on C. 

In a sense, we are simply identifying the curve C with the interval [0, L] by means of the 1-1 parameterizing 
function 7. The next theorem makes this quite plain. 

Theorem 6.7: 

Let C be a piecewise smooth curve of finite length L, and let 7 be a parameterization of C by arc 
length. If / is an integrable function on C, then 

f(s)ds= f f(j(t))dt. (6.35) 

c Jo 

Proof: 

First, if h is a step function on C, let {zj} be a partition of C for which h (z) is a constant a,j on 
the portion of the curve between Zj-i and Zj. Let {tj} be the partition of [0, L] for which Zj = 7 (tj) 
for every j. Note that ft o 7 is a step function on [0, L] , and that h o 7 (i) = Oj for all t € (tj-i,tj) . 
Then, 



J c h(s)ds = Y. j =iO- 3 L{z 3 ^ 1 



■>"3) 



E"=iOj^(7(*i-i).7(*j)) (636 s 

J2]=i a j (tj - tj-l) 

J ho- t (t) dt, 
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which proves the theorem for step functions. 

Finally, if / = limh n is an integrable function on C, then the sequence {h n o 7} converges uniformly to 

/ o 7 on [0, L] , and so 

J c f (s) ds = limj „h n (s) ds 

= Umtfh n (j(t))dt (6.37) 

= /„'/ (7 (*))<**, 

where the final equality follows from Theorem 5.6, p. 127. Hence, Theorem 6.7, p. 166 is proved. 

Although the basic definitions of integrable and integral, with respect to arc length, are made in terms of 
the particular parameterization 7 of the curve, for computational purposes we need to know how to evaluate 
these integrals using different parameterizations. Here is the result: 

Theorem 6.8: 

Let C be a piecewise smooth curve of finite length L, and let <j> : [a, b] — > C be a parameterization 
of C. If / is an integrable function on C. Then 

f f(s)ds= f f(<f>(t))\<l>'(t)\dt. (6.38) 

J C Ja 

Proof: 

Write 7 : [0, L] — > C for a parameterization of C by arc length. As in the proof to Theorem 6.7, 
p. 166, we write g : [a, b] — > [0, L] for 7 _1 o </>. Just as in that proof, we know that g is a piecewise 
smooth function on the interval [a, b] . Hence, recalling that I7' (t) \ = 1 and g (t) > for all but a 
finite number of points, the following calculation is justified: 

J c f(s)ds = I Q L f(l(t))dt 

Iof(l(t))W(t)\dt 
= j b a f{l{g(u)))\i{g(u))\g{u)du 

= j b J{ 1 {g{u)))\i{g{u))\\g{u)\du (6.39) 

= j'f(4>(u))W(g(u))g(u)\du 
= L f {(/> {u)) \( gamma o g) (u)\du 
fj{4>{u))W{u)\du, 
as desired. 

Exercise 6.12 

Let C be the straight line joining the points (0, 1) and (1, 2) . 

a. Find the arc length parameterization 7 : [0, v2~] — * C. 

b. Let / be the function on this curve given by / (x, y) = x 2 y. Compute J „f (s) ds. 

c. Let / be the function on this curve that is defined by f(x,y) is the distance from (x,y) to 
the point (0,3) . Compute J f (s) ds. 

The final theorem of this section sums up the properties of integrals with respect to arc length. There are 
no surprises here. 

Theorem 6.9: 

Let C be a piecewise smooth curve of finite length L, and write / (C) for the set of all functions 
that are integrable with respect to arc length on C. Then: 
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1. / (C) is a vector space ovr the real numbers, and 



(af (s) + bg (s)) ds = a f{s)ds + b g{s) ds (6.40) 

C J c J c 

for all f,g & I (C) and all a,b e R. 

2. (Positivity) If / (z) > for all z £ C, then J c f (s) ds > 0. 

3. If / <= J (C) , then so is |/|, and \J c f (s) ds\ < J c \f (s) | ds. 

4. If / is the uniform limit of functions /„, each of which is in 1(C) , then / s / (C) and 
/ c / (s) rfs = limj c f n (s) ds. 

5. Let {«„} be a sequence of functions in I (C) , and suppose that for each n there is a number m n , 
for which \u n (z) \ < m n for all z € C, and such that the infinite series Yl rn n converges. Then 
the infinite series J2u n converges uniformly to an integrable function, and J c J2u n (s) ds = 

J2Ic u n( s ) ds - 
Exercise 6.13 

a. Prove the preceding theorem. Everything is easy if we compose all functions on C with the 
parameterization 7, obtaining functions on [0, L] , and then use Theorem 5.6, p. 127. 

b. Suppose C is a piecewise smooth curve of finite length joining z\ and z%. Show that the 
integral with respect to arc length of a function / over C is the same whether we think of C 
as being a curve from z\ to zi or, the other way around, a curve from zi to z\. 

6.7: 

REMARK Because of the result in part (b) of the preceding exercise, we speak of "integrating 
over C" when we are integrating with respect to arc length. We do not speak of "integrating from 
Z\ to Z2," since the direction doesn't matter. This is in marked contrast to the next two kinds of 
integrals over curves that we will discuss. 

here is one final bit of notation. Often, the curves of interest to us are graphs of real-valued 
functions. If g : [a, b] — > R is a piecewise smooth function, then its graph C is a piecewise smooth 
curve, and we write J ra hf >/ (5) ds for the integral with respect to arc length of / over C = 
graph (5) . 

6.5 Contour Integrals 5 

We discuss next what appears to be a simpler notion of integral over a curve. In this one, we really do regard 
the curve C as a subset of the complex plane as opposed to two-dimensional real space; we will be integrating 
complex-valued functions; and we explicitly think of the parameterizations of the curve as complex-valued 
functions on an interval [a, b] . Also, in this definition, a curve C from z\ to z-i will be distinguished from its 
reverse, i.e., the same set C thought of as a curve from z^ to z\. 

Definition 6.8: 

Let C be a piecewise smooth curve from z\ to z^ in the plane C, parameterized by a (complex- 
valued) function </> : [a, b] — » C. If / is a continuous, complex- valued function on C, The contour 
integral of f from z\ to z-i along C will be denoted by J c f (Q d( or more precisely by J c z z \f (C) dC,, 
and is defindd by 



?J{QdC= / /(<£(*))</> (t)dt. (6.41) 

c 



5 This content is available online at <http://cnx.Org/content/m36230/l.2/>. 
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6.8: 

REMARK There is, as usual, the question about whether this definition depends on the param- 
eterization. Again, it does not. See the next exercise. 

The definition of a contour integral looks very like a change of variables formula for integrals. 
See Theorem 5.11, Integration by Substitution, p. 133 and part (e) of Exercise 5.22. This is an 
example of how mathematicians often use a true formula from one context to make a new definition 
in another context. 

Notice that the only difference between the computation of a contour integral and an integral 
with respect to arc length on the curve is the absence of the absolute value bars around the factor 
<f> (t) . This will make contour integrals more subtle than integrals with respect to arc length, just 
as conditionally convergent infinite series are more subtle than absolutely convergent ones. 

Note also that there is no question about the integrability of / (cp (t)) 4> (t) , because of Exer- 
cise 5.22. / is bounded, </>' is improperly-integrable on (a, b) , and therefore so is their product. 

Exercise 6.14 

a. State and prove the "independence of parameterization" result for contour integrals. 

b. Prove that 

/ ?J(OdC=-[ ?J(Od(. (6.42) 

J c J c 

Just remember how to parameterize the curve in the opposite direction. 

c. Establish the following relation between the absolute value of a contour integral and a corre- 
sponding integral with respect to arc length. 

/(CMC|< / \f{s)\ds. (6.43) 

c J c 

Not all the usual properties hold for contour integrals, e.g., like those in Theorem 6.9, p. 167 above. The 
functions here, and the values of their contour integrals, are complex numbers, so all the properties of 
integrals having to do with positivity and inequalities, except for the one in part (c) of Exercise 6.14, no 
longer make any sense. However, we do have the following results for contour integrals, the verification of 
which is just as it was for Theorem 6.9, p. 167. 

Theorem 6.10: 

Let C be a piecewise smooth curve of finite length joining Z\ to z 2 - Then the contour integrals of 
continuous functions on C have the following properties. 

1. If / and g are any two continuous functions on C, and o and b are any two complex numbers, 
then 



(af(() + bg(())d( = a f (()<% + b g (<) d(. (6.44) 

c J c J c 

2. If / is the uniform limit on C of a sequence {/„} of continuous functions, then J c f (£) d£ = 
limj c f n (C) d(. 

3. Let {u n } be a sequence of continuous functions on C, and suppose that for each n there is 
a number m n , for which \u n (z) \ < m n for all z € C, and such that the infinite series £m„ 
converges. Then the infinite series Yl u n converges uniformly to a continuous function, and 

| c £MC)dC = £/ c MCMC 

In the next exercise, we give some important contour integrals, which will be referred to several times in 
the sequel. Make sure you understand them. 

Exercise 6.15 

Let c be a point in the complex plane, and let r be a positive number. Let C be the curve 
parameterized by <f> : [—tt, tt — e] : C defined by <j> (t) = c + re lt = c + rcos (t) + irsin (t) . For each 
integer n e Z, define /„ (z) = (z — c) n . 
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1. What two points z\ and z 2 does C join, and what happens to z 2 as e approaches 0? 

2. Compute J c f n (C) d£ f° r an integers n, positive and negative. 

3. What happens to the integrals computed in part (b) when e approaches 0? 



4. Set e = ir, and compute J c f n (C) <^C f° r an integers 

5. Again, set e = ir. Evaluate 



n. 



cos (C — c) , f sin (C — c) , 

— -^ d( and / ^ '- d(. (6.45) 

cC-c J c C-c 

HINT: Make use of the infinite series representations of the trigonometric functions. 



6.6 Vector Fields, Differential Forms, and Line Integrals 6 

We motivate our third definition of an integral over a curve by returning to physics. This definition is very 
much a real variable one, so that we think of the plane as R 2 instead of C. A connection between this real 
variable definition and the complex variable definition of a contour integral will emerge later. 

Definition 6.9: 

By a vector field on an open subset U of R 2 , we mean nothing more than a continuous function 

V (x,y) = (P(x,y) ,Q(x,yj) from U into R 2 . The functions P and Q are called the components 
of the vector field V ■ 

We will also speak of smooth vector fields, by which we will mean vector fields V both of whose 
component functions P and Q have continuous partial derivatives 

tialP tialP tialQ tialQ 

r~ i r~ i —and — (6.46) 

tialx tialy tialx tialy 

on U. 

6.9: 

The idea from physics is to think of a vector field as a force field, i.e., something that exerts a force 

at the point (x,y) with magnitude | V (x,y) | and acting in the direction of the vector V (x,y) ■ 
For a particle to move within a force field, "work" must be done, that is energy must be provided to 
move the particle against the force, or energy is given to the particle as it moves under the influence 
of the force field. In either case, the basic definition of work is the product of force and distance 
traveled. More precisely, if a particle is moving in a direction u within a force field, then the work 
done on the particle is the product of the component of the force field in the direction of u and 
the distance traveled by the particle in that direction. That is, we must compute dot products of 
the vectors V (x,y) and u (x,y) . Therefore, if a particle is moving along a curve C, parameterized 
with respect to arc length by 7 : [0,L] — ■> C, and we write 7 (t) = (x (t) ,y(t)) , then the work 

W (21, z-i) done on the particle as it moves from z\ = 7 (0) to zi = 7 (L) within the force field V, 
should intuitively be given by the formula 

W( Zl ,z 2 ) = J L <Vh(t))\j'(t)> dt 

= f L P(x(t),y(t))x(t) + Q(x(t),y(t))y(t)dt (6.47) 

= f c P dx + Qdy, 

where the last expression is explicitly defining the shorthand notation we will be using. 



6 This content is available online at <http://cnx.Org/content/m36232/l.2/>. 
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The preceding discussion leads us to a new notion of what kind of object should be "integrated" over a 
curve. 

Definition 6.10: 

A differential form on a subset U of R 2 is denoted by u> = Pdx + Qdy, and is determined by two 
continuous real- valued functions P and Q on U. We say that u> is bounded or uniformly continuous 
if the functions P and Q are bounded or uniformly continuous functions on U. We say that the 
differential form ui is smooth of order k if the set U is open, and the functions P and Q have 
continuous mixed partial derivatives of order k. 

If to = Pdx + Qdy is a differential form on a set U, and if C is any piecewise smooth curve of 
finite length contained in U, then we define the line integral J c u> of w over C by 



u>= / Pdx + Qdy = P(-y(t))x (t) + Q('y(t))y (t) dt, (6.48) 

c J c Jo 

where 7 (t) = (x (t) , y (t)) is a parameterization of C by arc length. 

6.10: 

REMARK There is no doubt that the integral in this definition exists, because P and Q are 
continuous functions on the compact set C, hence bounded, and 7' is integrable, implying that 
both x and y are integrable. Therefore P (7 (£)) x (t) + Q (7 (£)) y' (£) is integrable on (0, L) . 

These differential forms u> really should be called "differential 1-forms." For instance, an example 
of a differential 2-form would look like R dxdy, and in higher dimensions, we could introduce notions 
of differential forms of higher and higher orders, e.g., in 3 dimension things like P dxdy + Q dzdy + 
Rdxdz. Because we will always be dealing with R 2 , we will have no need for higher order differential 
forms, but the study of such things is wonderful. Take a course in Differential Geometry! 

Again, we must see how this quantity J „u> depends, if it does, on different parameterizations. 
As usual, it does not. 

Exercise 6.16 

Suppose u> = Pdx + Qdy is a differential form on a subset U of R 2 . 

a. Let C be a piecewise smooth curve of finite length contained in U that joins z\ to zi- Prove 
that 

uj= Pdx + Qdy= P{4>{t))x {t) + Q{4>{t))y (t) dt (6.49) 

C J C J a 

for any parameterization <p : [a, b] — > C having components x (t) and y (t) . 

b. Let C be as in part (a), and let C denote the reverse of C, i.e., the same set C but thought 



of as a curve joining z-i to z\. Show that J ^.ui = —J 



u>. 



C 

c 

c. Let C be as in part (a). Prove that 

\f Pdx + Qdy\<{M P + M Q )L, (6.50) 

J c 

where Mp and Mq are bounds for the continuous functions \P\ and \Q\ on the compact set 
C, and where L is the length of C. 

Example 6.1 

The simplest interesting example of a differential form is constructed as follows. Suppose U is an 
open subset of R 2 , and let / : U — » R be a differentiable real- valued function of two real variables; 
i.e., both of its partial derivatives exist at every point (x,y) € U. (See the last section of Chapter 
IV.) Define a differential form uj = df, called the differential of /, by 

tialf tialf 

df = —— dx + —— dy, (6.51) 

tialx tialy 
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i.e., P = tialf/tialx and Q = tialf/tialy. These differential forms df are called exact differential 
forms. 

6.11: 

REMARK Not every differential form u> is exact, i.e., of the form df. Indeed, determining which 
w's are df's boils down to what may be the simplest possible partial differential equation problem. 
If lo is given by two functions P and Q, then saying that u> = df amounts to saying that / is a 
solution of the pair of simultaneous partial differential equations 

t ^l = Pand t ^l = Q . (6 .52) 

tialx tialy 

See part (b) of the exercise below for an example of a nonexact differential form. 

Of course if a real-valued function / has continuous partial derivatives of the second order, then Theo- 
rem 4.22, Theorem on mixed partials, p. 112 tells us that the mixed partials f xy and f yx must be equal. So, 
if ui = Pdx + Qdy = df for some such /, Then P and Q would have to satisfy tialP/ tialy = tialQ /tialx. 
Certainly not every P and Q would satisfy this equation, so it is in fact trivial to find examples of differential 
forms that are not differentials of functions. A good bit more subtle is the question of whether every differ- 
ential form Pdx + Qdy, for which tialP/tialy = tialQ/tialx, is equal to some df. Even this is not true in 
general, as part (c) of the exercise below shows. The open subset U on which the differential form is defined 
plays a significant role, and, in fact, differential forms provide a way of studying topologically different kinds 
of open sets. 

In fact, although it may seem as if a differential form is really nothing more than a pair of functions, 
the concept of a differential form is in part a way of organizing our thoughts about partial differential 
equation problems into an abstract mathematical context. This abstraction is a good bit more enlightening 
in higher dimensional spaces, i.e., in connection with functions of more than two variables. Take a course in 
Multivariable Analysis! 

Exercise 6.17 

a. Solve the pair of simultaneous partial differential equations 

Half tialf 

— — = x + y and — = x — y. (6.53) 

tialx tialy 

b. Show that it is impossible to solve the pair of simultaneous partial differential equations 

tialf tialf o 

— = x + y and — = y . (6.54) 

tialx tialy 

Hence, conclude that the differential form iv = (x + y) dx + y 3 dy is not the differential df of 
any real-valued function /. 

c. Let U be the open subset of R 2 that is the complement of the single point (0, 0) . Let P (x, y) = 
—y/(x 2 + y 2 ) and Q(x,y) = x/(x 2 + y 2 ). Show that tialPj tialy = tialQ /tialx at every 
point of U, but that uj = Pdx + Qdy is not the differential df of any smooth function / on 
U. HINT: If P were f x , then / would have to be of the form / (x, y) = —tan -1 (x/y) + g (y) , 
where g is some differentiable function of y. Show that if Q = f y then g (y) is a constant c. 
Hence, f (x,y) must be —tan" 1 (x/y) + c. But this function / is not continuous, let alone 
differentiable, at the point (1, 0) . Consider limf (1, 1/n) and limf (1, — 1/n) . 

The next thing we wish to investigate is the continuity of J„u> as a function of the curve C. This brings out 
a significant difference in the concepts of line integrals versis integrals with respect to arc length. For the 
latter, we typically think of a fixed curve and varying functions, whereas with line integrals, we typically 
think of a fixed differential form and variable curves. This is not universally true, but should be kept in 
mind. 
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Theorem 6.11: 

Let u> = Pdx + Qdy be a fixed, bounded, uniformly continuous differential form on a set U in R 2 , 
and let C be a fixed piecewise smooth curve of finite length L, parameterized by </> : [a, b] — » C, that 

is contained in U. Then, given an e > there exists a 5 > such that, for any curve C contained 

in U,\J c lo — J \lo\ < e whenever the following conditions on the curve C hold: 

c 

1. C is a piecewise smooth curve of finite length L contained in U, parameterized by <fi: [a, b] —>C ■ 

2. |0 0) - (f> 0) I < 6 for all t £ [a, b] . 

3. J b a W(t)-4> (t)\dt<5. 

Proof: 

Let e > be given. Because both P and Q are bounded on U, let Mp and Mq be upper bounds 
for the functions |P| and \Q\ respectively. Also, since both P and Q are uniformly continuous 
on U, there exists a S > such that if | (c, d) — (c',d) | < S, then \P (c, d) — P (c, d) \ < e/AL 
and \Q(c,d) — Q (c,d) \ < e/AL. We may also choose this 5 to be less than both e/AMp and 

e/AMq. Now, suppose C is a curve of finite length L, parameterized by <p: [a,b] — >C, and that 

\<t> (t) - <j>(t)\<8 for all t e [a, b] , and that J b \<f> (t) - <j> (t)\<6. Writing (j> (t) = (x (t) , y (t)) and 

4>{t)= (x(t),y (t) J , we have 

< \J c Pdx + Qdy- f~Pdx + Qdy\ 

c 

\tfp(^(t))x'(t)-pL(t))x (t) + Q(0(t))t/'(t)-QU(t)J» (t)dt\ 

< Si \P (<f> (*)) *' (*) - P f (*) J * (*) I * + fa 10 (<t> (*)) y (*) - (<£ (*) J » (*) I dt 

< J b a \P (0 (t)) -PU (t) ) Hx (t) | d* + / a 6 |P (0 (t) ) H* (i) - i (t) | d* 



+ i! 10 OK*)) - U (*) ) Ms/* (t) I d< + f b a 10 U (t) j lb' (t) -v (t)\ dt 

< -t J b \x (t) \dt + M P S b a \x (t) - x (t)\ dt 

+ f E J b a \y'(t)\dt + M Q Jl\y(t)-y (t)\dt 

< IE la $ (*) \dt + M P J b a |0' (t) - <f> (t) j dt 

+ IE /a W (*) I * + M Q /a I*' (t) " (t) I dt 

< f + | + M P( 5 + M Q< 5 



(6.55) 



as desired. 
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Again, we have a special notation when the curve C is a graph. If g : [a, b] — > R is a piecewise smooth 
function, then its graph C is a piecewise smooth curve, and we write J ra h , sP dx + Q dy for the line integral 
of the differential form Pdx + Qdy over the curve C = graph (g) . 

As alluded to earlier, there is a connection between contour integrals and line integrals. It is that a single 
contour integral can often be expressed in terms of two line integrals. Here is the precise statement. 

Theorem 6.12: 

Suppose C is a piecewise curve of finite length, and that / = u + iv is a complex- valued, continuous 
function on C. Let <f> : [a, b] — > C be a parameterization of C, and write cf>(t) = x (t) + iy (t) . Then 



Proof: 

We just compute: 



f(Qd(= {Udx-vdy)+ {vdx + udy). (6.56) 



IcfiOdC = J^f(<f>(t))<f>'(t)dt 

= j h a (u (</> (*)) + iv (<f, (*))) {x (t) + iy' (t)) dt 

f b a (u(4>(t))x'(t)-v(4>(t))y'(t)) 

+ i(v{4> (£)) x (t) + u{4> (t)) y (t)) dt (6.57) 

j b a (u^(t))x(t)-v^(t))y{t))dt 
+ iJa(v(4>(t))x'(t) + u(4>(t))y'(t)) dt 
= I c u dx — v dy + if c v dx + u dy, 



as asserted. 



6.7 Integration Around Closed Curves, and Green's Theorem 7 

Thus far, we have discussed integration over curves joining two distinct points Z\ and z^. Very important 
in analysis is the concept of integrating around a closed curve, i.e., one that starts and ends at the same 
point. There is nothing really new here; the formulas for all three kinds of integrals we have defined will look 
the same, in the sense that they all are described interms of some parameterization <f>. A parameterization 
4> : [a, b] — > C of a closed curve C is just like the parameterization for a curve joining two points, except that 
the two points <f> (a) and <j> (b) are equal. 

Two problems are immediately apparent concerning integrating around a closed curve. First, where do 
we start on the curve, which point is the initial point? And second, which way to we go around the curve? 
Recall tha if <fi : [a, b] — > C is a parameterization of C, then tp : [a, b] — > C, defined by ip(t) = <p(a + b — t) , 
is a parameterization of C that is the reverse of <fi, i.e., it goes around the curve in the other direction. If 
we are integrating with respect to arc length, this reverse direction won't make a difference, but, for contour 
integrals and line integrals, integrating in the reverse direction will introduce a minus sign. 

The first question mentioned above is not so difficult to handle. It doesn't really matter where we start 
on a closed curve; the parameterization can easily be shifted. 

Exercise 6.18 

Let <f>\a, b] — > R 2 be a piecewise smooth function that is 1-1 except that <p(a) = <j>(b) . For each 

< c < b - a, define <fi: [a+ c,b+ c] : R 2 by <f> (t) = 4> (t) for a + c < t < b, and <f> (t) = 4> (t - b + a 
for b < t < b + c. 



7 This content is available online at <http://cnx.Org/content/m36233/l.2/>. 
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1. Show that is a piecewise smooth function, and that the range C of <f> coincides with the 

range of <fi . 

2. Let / be an integrable (with respect to arc length) function on C. Show that 

f(<t>(t))\<f>'(t)\dt = I /(0(t)W(t)|di. (6.58) 



That is, the integral J c f (s) ds of / with respect to arc length around the closed curve C is 
independent of where we start. 

3. Let / be a continuous complex-valued function on C. Show that 

f(<f>(t))<f>'(t)dt = J f U (t)U (t) dt. (6.59) 

That is, the contour integral J c f (Q d( of / around the closed curve C is independent of 
where we start. 

4. Let u> = Pdx + Qdy be a differential form on C. Prove that 

P (</> (t))x (t) + Q{4> (t))y (t)dt= J pU (t) \x (t) + Q ( <j> (t) j y (t) dt. (6.60) 

That is, the line integral J „w of lo around C is independent of where we start. 

The question of which way we proceed around a closed curve is one that leads to quite intricate and difficult 
mathematics, at least when we consider totaly general smooth curves. For our purposes it wil, suffice to 
study a special kind of closed curve, i.e., curves that are the boundaries of piecewise smooth geometric sets. 
Indeed, the intricate part of the general situation has a lot to do with determining which is the "inside" of 
the closed curve and which is the "outside," a question that is easily settled in the case of a geometric set. 
Simple pictures make this general question seem silly, but precise proofs that there is a definite inside and 
a definite outside are difficult, and eluded mathematicians for centuries, culminating in the famous Jordan 
Curve Theorem, which asserts exactly what our intuition predicts: 

Theorem 6.13: Jordan Curve Theorem 

The complement of a closed curve is the union of two disjoint components, one bounded and one 
unbounded. 

We define the bounded component to be the inside of the curve and the unbounded component to be the 
outside. 

We adopt the following convention for how we integrate around the boundary of a piecewise smooth 
geometric set S. That is, the curve C$ will consist of four parts: the lower boundary (graph of the lower 
bounding function I), the righthand boundary (a portion of the vertical line x = b), the upper boundary 
(the graph of the upper bounding function u), and finally the lefthand side (a portion of the vertical line 
x = a). By integrating around such a curve Cg, we will always mean proceeding counterclockwise around 
the curves. Specifically, we move from left to right along the lower boundary, from bottom to top along 
the righthand boundary, from right to left across the upper boundary, and from top to bottom along the 
lefthand boundary. Of course, as shown in the exercise above, it doesn't matter where we start. 

Exercise 6.19 

Let S be the closed piecewise smooth geometric set that is determined by the interval [a, b] and 
the two piecewise smooth bounding functions u and /. Assume that the boundary Cs of S has 
finite length. Suppose the graph of u intersects the lines x = a and x = b at the points (a, c) and 
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(b, d) , and suppose that the graph of I intersects those lines at the points (a, e) and (b, f) . Find a 
parameterization <f> : [a , b\ — ► Cs of the curve Cs- 

HINT: Try using the interval [a,b + d— f + b— a + c — e] as the domain [a , b'~\ of <fi. 

The next theorem, though simple to state and use, contains in its proof a combinatorial idea that is truly 
central to all that follows in this chapter. In its simplest form, it is just the realization that the line integral 
in one direction along a curve is the negative of the line integral in the opposite direction. 

Theorem 6.14: 

Let Si, ..., S n be a collection of closed geometric sets that constitute a partition of a geometric set 
S, and assume that the boundaries of all the Si's, as well as the boundary of S, have finite length. 
Suppose wis a continuous differential form on all the boundaries {Cs k }- Then 



c s f.=l Cs <. 



w. (6.61) 



Proof: 

We give a careful proof for a special case, and then outline the general argument. Suppose then 
that S is a piecewise smooth geometric set, determined by the interval [a, b] and the two bounding 
functions u and I, and assume that the boundary Cs has finite length. Suppose m (a;) is a piecewise 
smooth function on [a,b] , satisfying J \m'\ < oo, and assume that I (x) < m(x) < u(x) for all 
x g (a, b) . Let Si be the geometric set determined by the interval [a, b] and the two bounding 
functions m and I, and let S^ be the geometric set determined by the interval [a, b] and the two 
bounding functions u and m. We note first that the two geometric sets Si and S2 comprise a 
partition of the geometric set S, so that this is indeed a pspecial case of the theorem. 

Next, consider the following eight line integrals: First, integrate from left to write along the 
graph of m, second, up the line x = b from (b,m(b)) to (b,u(b)) , third, integrate from right to 
left across the graph of u, fourth, integrate down the line x = a from (a, u (a)) to (a, m (a)) , fifth, 
continue down the line x = a from (a, m (a)) to (a, I (a)) , sixth, integrate from left to right across 
the graph of I, seventh, integrate up the line x = b from (6, / (6)) to (b, m (b)) , and finally, integfrate 
from right to left across the graph of m. 

The first four line integrals comprise the line integral around the geometric set S2, and the last 
four comprise the line integral around the geometric set Si. On the other hand, the first and eighth 
line integrals here cancel out, for one is just the reverse of the other. Hence, the sum total of these 
eight line integrals, integrals 2-7, is just the line integral around the boundary Cs of S. Therefore 

/ oj= uj+ lo (6.62) 

as desired. 

We give next an outline of the proof for a general partition Si,...,S n of S. Let Sk be determined by the 
interval [ofe, 6fc] an d the two bounding functions u^ and If.. Observe that, if the boundary Cs k of Sk intersects 
the boundary Cs- of Sj in a curve C, then the line integral of u> along C, when it is computed as part of 
integrating counterclockwise around Sk, is the negative of the line integral along C, when it is computed as 
part of the line integral counterclockwise around Sj. Indeed, the first line integral is the reverse of the second 
one. (A picture could be helpful.) Consequently, when we compute the sum of the line integrals of u> around 
the Cs k 's, All terms cancel out except those line integrals that ar computed along parts of the boundaries 
of the Sfc's that intersect no other Sj. But such parts of the boundaries of the S^'s must coincide with parts 
of the boundary of S. Therefore, the sum of the line integrals of u> around the boundaries of the S^'s equals 
the line integral of u> around the boundary of S, and this is precisely what the theorem asserts. 
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Exercise 6.20 

Prove the analog of Theorem 6.14, p. 176 for contour integrals: Let Si, ...,S n be a collection of 
closed geometric sets that constitute a partition of a geometric set S, and assume that the boundaries 
of all the Si's, as well as the boundary of S, have finite length. Suppose / is a continuous complex- 
valued function on all the boundaries {Cs k } as well as on the boundary Cs- Then 



/„ 



/ (C) dC = ]T / / (C) d(. (6.63) 



s fe=i 

We come now to the most remarkable theorem in the subject of integration over curves, Green's Theorem. 
Another fanfare, please! 

Theorem 6.15: Green 

Let S be a piecewise smooth, closed, geometric set, let Cs denote the closed curve that is the 
boundary of S, and assume that Cs is of finite length. Suppose u) = Pdx + Qdy is a continuous 
differential form on S that is smooth on the interior S° of S. Then 

/ 0,= / Pd X + Qdy= f tpQ_pi. ^.^ 

J Cs J Cs J S tmlx tlal V 

6.12: 

REMARK The first thing to notice about this theorem is that it connects an integral around 
a (1-dimensional) curve with an integral over a (2-dimensional) set, suggesting a kind of connec- 
tion between a 1-dimensional process and a 2-dimensional one. Such a connection seems to be 
unexpected, and it should therefore have some important implications, as indeed Green's Theorem 
does. 

The second thing to think about is the case when ui is an exact differential df of a smooth 
function / of two real variables. In that case, Green's Theorem says 

tialf tialf f 

dx + 777T7. d V = / \fvx ~ fxy) , (6.65) 



Cs tialx tialy J s 

which would be equal to if / € C 2 (S) , by Theorem 4.22, Theorem on mixed partials, p. 112. 
Hence, the integral of df around any such curve would be 0. If U is an open subset of R 2 , there 
may or may not be some other ui's, called closed differential forms, having the property that their 
integral around every piecewise smooth curve of finite length in U is 0, and the study of these closed 
differential forms u> that are not exact differential forms df has led to much interesting mathematics. 
It turns out that the structure of the open set U, e.g., how many "holes" there are in it, is what's 
important. Take a course in Algebraic Topology! 

The proof of Green's Theorem is tough, and we break it into several steps. 

Lemma 6.1: 

Suppose S is the rectangle [a, b] x [c, d] . Then Green's Theorem is true. 
Proof: 

We think of the closed curve Cs bounding the rectangle as the union of four straight lines, C\,C^, C3 
and C4, and we parameterize them as follows: Let <f> : [a, b] — > C\ be defined by <f> (t) = (t, c) ; let <j) : 
[b, b + d - c] -> C 2 be defined by 4> (t) = {b, t - b + c) ; let <f> : [b + d - c, b + d - c + b - a] -> C 3 be 
defined by <f> (t) = (b + d — c + b — t, d) ; and let <f> : [b+ d — c+ b — a,b + d— c+ b — a+ d — c] — > 
C4 be defined by <j>(t) = (a, b + d — c + b — a + d — t) . One can check directly to see that this <f> 
parameterizes the boundary of the rectangle S = [a, b] x [c, d] . 
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As usual, we write cf> (t) = (x (t) ,y (£)) . Now, we just compute, use the Fundamental Theorem 
of Calculus in the middle, and use part (d) of Exercise 5.30 at the end. 

= J c Pdx + Qdy + J c Pdx + Qdy 

+ f C3 Pdx + Qdy + l c Pdx + Q dy 
J b a P^(t))x'(t) + Q^(t))y'(t)dt 
+ C d - C P {4> (*)) x (t) +Q(cf> (*)) y (t) dt 

+ Ctc + "~ a P (* (*)) *' (*) + Q i<t> (*)) y (*) di 
+ / 6 +l7+6-; +d_C P (<£ (*)) *' (t) + (^ (i)) y' (t) rfi 

J a h P (t, c) rft + / 6 6+d_c Q{b,t-b + c) dt (6.66) 

+ Stttc +h ~ a P(b+d-c+b-t,d)(-l)dt 
+ Cd-c+ta +d ~ C Q(a,b + d-c+b-a+d-t)(-l)dt 
J b a P(t lC )dt+J c d Q(b,t)dt 
-J b a P(t 1 d)dt-J c d Q(a,t)dt 

it (Q ( b > t)-Q (o, 0) di " /a ( p (*> rf ) " P ( f . c )) dt 



proving the lemma. 



/*( 



tialy 

tialQ _ tialP 

tialx tialy ' 



Lemma 6.2: 

Suppose 5 is a right triangle whose vertices are of the form (a,c) , (6, c) and (b,d) . Then Green's 
Theorem is true. 
Proof: 

We parameterize the boundary Cs of this triangle as follows: For t € [a, b] , set <f>(t) = (t,c) ; 
for t G [6, 6 + d — c] , set <f> (t) = (b,t + c—b); and for t G [6 + d — c, 6 + d — c + b — a] , set (p (t) = 
(b + d— c+b— t,b+d — c+d — t). Again, one can check that this <fi parameterizes the boundary 
of the triangle S. 
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Write 4> (t) = (x (t) , y (tj) . Again, using the Fundamental Theorem and Exercise 5.30, we have 



fcs" 



J r P dx + Q dy 



^P^{t))x(t) + Q(rb(t))y(t)dt 
+ J b b+d - c P (4> (t)) x (t) + Q{4> (t)) y (t) dt 

+ Ctc +b ~ a p W (*)) * (*) + Q (<£ (*)) y (*) di 

J a fc P (i, c) rft + J b b+d ~ c Q(b,t + c-b) dt 

rb-\-d—c-\-b—a 
+ h+d-c 



+ Ii 



b+d—c+b—a 
b+d-c 



P(b+d-c+b-t,b+d-c+d- t) (-1) dt 
Q(6 + d-c + 6-t,6 + d-c + d-t)(-l) eft 

J*P(t,c)dt + J d Q(b,t)dt 

-I b a P(^(d+^ b (c-d)))ds 

-f c d Q(b+^ d (a-b)),sd S 
S c d (Q(b,s)-Q((b+^j(a-b)), S )) ds 

-f b a(p(s,(d+^ b (c-d)))-P(s,c))ds 

IcIb+i=4(a-b)Tni§(^ s ) dtds 



(6.67) 



rd rb 

'b+^^{a-b) tial. 

i(c-d) u a ip 



rb rd+\ 
Ja Jc 



tialy 



(s, i) dtds 



r ( tialQ _ tialP 
J S \ tialx tialy ' 



which proves Lemma 6.2, p. 178. 



Lemma 6.3: 

Suppose Si, ...,S n is a partition of the geometric set S, and that the boundary Cs k has finite length 
for all 1 < k < n. If Green's Theorem holds for each geometric set Sk, then it holds for S. 
Proof: 

From Theorem 6.14, p. 176 we have 



fc=i 



E ' a. " 



and from Theorem 5.24, p. 148 we have 



/^cx -ty — 2_j I ^°Cx -*i 
s k=1 J s k 



Since Green's Theorem holds for each k, we have that 

Wx ^y; 



Cs h 



S k 



and therefore 



as desired. 



U — I Qx — Py, 

C s J S 



(6.68) 



(6.69) 



(6.70) 



(6.71) 



Exercise 6.21 
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a. Prove Green's Theorem for a right triangle with vertices of the form (a, c) , (b, c) , and (a, d) . 

b. Prove Green's Theorem for a trapezoid having vertices of the form (a, c) , (6, c) ,(b, d) , and 
(a, e) , where both d and e are greater than c. HINT: Represent this trapezoid as the union 
of a rectangle and a right triangle that share a border. Then use Lemma 6.3, p. 179. 

c. Prove Green's Theorem for S any quadrilateral that has two vertical sides. 

d. Prove Green's Theorem for any geometric set S whose upper and lower bounding functions 
are piecewise linear functions. HINT: Show that S can be thought of as a finite union of 
quadrilaterals, like those in part (c), each one sharing a vertical boundary with the next. 
Then, using induction and the previous exercise finish the argument. 

We need one final lemma before we can complete the general proof of Green's Theorem. This one is where 
the analysis shows up; there are carefully chosen e's and S's. 

Lemma 6.4: 

Suppose S is contained in an open set U and that u> is smooth on all of U. Then Green's Theorem 
is true. 
Proof: 

Let the piecewise smooth geometric set S be determined by the interval [a, b] and the two bounding 
functions u and I. Using Theorem 2.11, p. 45, choose an r > such that the neighborhood 
N r (S) C U. Now let e > be given, and choose delta to satisfy the following conditions: 

a. 5 < r/2, from which it follows that the open neighborhood N$ (S) is a subset of the compact 
set N r/2 {S) . (See part (f) of Exercise 2.24.) 

b. 5 < e/4M, where Mis a common bound for all four continuous functions \P\, \Q\, \P y \, and 
\Q X \ on the compact set N r / 2 (S) . 

c. 5 < e/4M{b- a) . 

d. 5 satisfies the conditions of Theorem 6.11, p. 173. 

Next, using Theorem 6.1, p. 154, choose two piecewise linear functions p u and pi so that 

1. \u (x) - p u (x) | < (5/2 for all x G [a, b] . 

2. \l (x) - pi (x) | < 5/2 for all x G [a, b] . 

3 - la \ u ' ( X )-Pu( x )\ dx < $■ 

4 - la \l'{x)-p\{x)\dx<S. 

Let S be the geometric set determined by the interval [a, b] and the two bounding functions u and 

I , where u= p u + 5/2 and 1= pi — 5/2. We know that both u and I are piecewise linear functions. 
We have to be a bit careful here, since for some x's it could be that p u (x) < pi (x) . Hence, we 

could not simply use p u and pi themselves as bounding functions for S ■ We do know from (1) and 

(2) that u (x) < u (x) and / (x) > I (x) , which implies that the geometric set S is contained in 

the geometric set S ■ Also S is a subset of the neighborhood Ng (s) , which in turn is a subset of 
the compact set N r / 2 {S) . 

Now, by part (d) of the preceding exercise, we know that Green's Theorem holds for S ■ That is 

u= L{Qx-Py)- (6.72) 
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We will show that Green's Theorem holds for S by showing two things: (i) \J „ to— J „ u>\ < 4e, and 

s 

(ii) \J s (Qx) — Py — J - (Qx — Py) \ < £■ We would then have, by the usual adding and subtracting 

s 
business, that 

I / w - / (Qx ~ P y ) | < 5e, (6.73) 

J C s J S 

and, since e is an arbitrary positive number, we would obtain 



w= / (Qx--P w ). (6-74) 

C s J S 

Let us estabish (i) first. We have from (1) above that \u (x) — u (x) \ < S for all x e [a, b] , and 
from (3) that 

b ~' f-b 

\u (x) - u (x)\dx = I \u (x) - p u (x) | dx < 6. (6.75) 

•J a 

Hence, by Theorem 6.11, p. 173, 

w- / /~\w| < £• (6.76) 

graph(u) J graph [ 

Similarly, using (2) and (4) above, we have that 

l/ w-/ /^\w|<e. (6.77) 

J graph(i) J graph j I 



Also, the difference of the line integrals of w along the righthand boundaries of S and 5 is less 
than e. Thus 



6,u(6) 

l/cS'^ - /c } ~ < -I = I /,?? ^ *) di - /" w ( & ' *) di l 

M(&)| *(*>) 



< i/j;g ) Q(6 ) t)dti + i/r ) Q(6 ) t)(fti 

< mI|J(6)-J(6)| + |«(6)-«(6)|J 

< M(5 + <J) 

2M(5 

< £. 
Of course, a similar calculation shows that 



a, I (a) 



(6.78) 



/j*-/ c - n W|<& ^ 
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These four line integral inequalities combine to give us that 

u> - / u>\ < 4e, (6.80) 

Cs J C„ 

s 

establishing (i). 

Finally, to see (ii), we just compute 

o < \J~(Qv-Px)-J s (Qv-Px)\ 

s 

= I la .P W (Q* ((*. S ) ~ P V (*. S )) dsdt - la Im ®* (*> S ) " P V (*' S )) <*"**! 

= I la l m (Qx (*. *) " ^ (*, *)) d*<ft + la /„"(!? (0, (*, «) " ^ (*. *)) d*dt| f g gl) 

< 2MU b a \l{t)-l (t)\ + \u(t)-u(t)\dt 

< AM5 {b - a) 

< e. 
This establishes (ii), and the proof is complete. 

At last, we can finish the proof of this remarkable result. 
Theorem 6.16: PROOF OF GREEN'S THEOREM 

Proof: 

As usual, let S be determined by the interval [a, b] and the two bounding functions u and I. Recall 
that u (x) — I (x) > for all x € (a, b) . For each natural number n > 2, let S n be the geometric 
set that is determined by the interval [a + l/n,b— 1/n] and the two bounding functions u n and 
l n , where u n = u — (u — I) /n restricted to the interval [a + 1/n, b — 1/n] , and l n = I + (u — I) /n 
restricted to [a + 1/n, b — 1/n] . Then each S n is a piecewise smooth geometric set, whose boundary 
has finite length, and each S n is contained in the open set S° where by hypothesis u> is smooth. 
Hence, by Lemma 4, Green's Theorem holds for each S n . Now it should follow directly, by taking 
limits, that Green's Theorem holds for S. In fact, this is the case, and we leave the details to the 
exercise that follows. 

Exercise 6.22 

Let S, u>, and the S n 's be as in the preceding proof. 

a. Using Theorem 6.11, p. 173, show that 

/ uj = lim u>. (6.82) 

J Cs J Cs n 

b. Let / be a bounded integrable function on the geometric set S. Prove that 

f = lim[ f. (6.83) 

s J s n 

c. Complete the proof to Green's Theorem; i.e., take limits. 
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6.13: 

REMARK Green's Theorem is primarily a theoretical result. It is rarely used to "compute" a 
line integral around a curve or an integral of a function over a geometric set. However, there is 
one amusing exception to this, and that is when the differential form u> = x dy. For that kind of u>, 
Green's Theorem says that the area of the geometric set S can be computed as follows: 

w-fS-hmt-h." 1 '- (M4) 

This is certainly a different way of computing areas of sets from the methods we developed earlier. Try 
this way out on circles, ellipses, and the like. 
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Chapter 7 

The Fundamental Theorem of Algebra, 
and The Fundamental Theorem of 
Analysis 



7.1 The Fundamental Theorem of Algebra, and the Fundamental 
Theorem of Analysis 1 

In this chapter we will discover the incredible difference between the analysis of functions of a single complex 
variable as opposed to functions of a single real variable. Up to this point, in some sense, we have treated 
them as being quite similar subjects, whereas in fact they are extremely different in character. Indeed, if / 
is a differentiable function of a complex variable on an open set U C C, then we will see that / is actually 
expandable in a Taylor series around every point in U. In particular, a function /of a complex variable is 
guaranteed to have infinitely many derivatives on U if it merely has the first one on U. This is in marked 
contrast with functions of a real variable. See part (3) of Theorem 4.17, p. 104. 
The main points of this chapter are: 

1. The Cauchy-Riemann Equations (Theorem 7.1, Cauchy-Riemann equations, p. 186), 

2. Cauchy's Theorem (Theorem 7.3, Cauchy's Theorem, Fundamental Theorem of Analysis, p. 188), 

3. Cauchy Integral Formula (Theorem 7.4, Cauchy Integral Formula, p. 189), 

4. A complex-valued function that is differentiable on an open set is expandable in a Taylor 
series around each point of the set (Theorem 7.5, p. 192), 

5. The Identity Theorem (Theorem 7.6, Identity Theorem, p. 194), 

6. The Fundamental Theorem of Algebra (Theorem 7.7, Fundamental Theorem of Algebra, p. 195), 

7. Liouville's Theorem (Theorem 7.8, Liouville, p. 196), 

8. The Maximum Modulus Principle (corollary to Corollary 7.4, Maximum Modulus Principle, p. 
198), 

9. The Open Mapping Theorem (Theorem 7.10, Open Mapping Theorem, p. 198), 

10. The uniform limit of analytic functions is analytic (Theorem 7.12, p. 201), and 

11. The Residue Theorem (Theorem 7.17, Residue Theorem, p. 206). 



lr This content is available online at <http://cnx.Org/content/m36234/l.2/>. 
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7.2 Cauchy's Theorem 2 

We begin with a simple observation connecting differentiability of a function of a complex variable to a 
relation among of partial derivatives of the real and imaginary parts of the function. Actually, we have 
already visited this point in Exercise 4.8. 

Theorem 7.1: Cauchy-Riemann equations 

Let / = u+ iv be a complex- valued function of a complex variable z = x + iy = (x,y) , and suppose 
/ is differentiable, as a function of a complex variable, at the point c = (a, b) . Then the following 
two partial differential equations, known as the Cauchy-Riemann Equations, hold: 

tialu tialv 

{a,b) = ——(a,b), (7.1) 



and 



tialx tialy 



tialu , ,. tialv , ,, ,„ „. 

—(a,b) = i-(a,b). 7.2 

tialy y ' ; tialx y ' ; v ' 

Proof: 

We know that 

/ c = (jmft -* , (7.3) 

h 

and this limit is taken as the complex number h approaches 0. We simply examine this limit for 
real h's approaching and then for purely imaginary h's approaching 0. For real h's, we have 

/'(c) = f(a + ib) 

= i im f(a+h+ib)-f(a+ib) 

h^O h 

= Umh : Q u ( a + h ' b )+ iv ( a + h < b )- u ( a ' b )- iv ( a < b ) (7.4) 

= hm I — ^-^ + ilim I — ^- L ^ L 

For purely imaginary h's, which we write as h = ik, we have 
/'(c) = f(a + ib) 

= i im f(a+i(b+k))-f(a+ib) 

_ i ■ u{a.b+k)+iv{a.b+k) — u{a.b) — iv{a,b) /— ^\ 



-ilim 



u(a,b-\-k) — u{a,b) . v(a 7 b-\-k) — v(a : b) 



fc^O k k 

-i^(^) + ^(a,b). 

Equating the real and imaginary parts of these two equivalent expressions for /' (c) gives the 
Cauchy-Riemann equations. 

As an immediate corollary of this theorem, together with Green's Theorem (Theorem 6.15, Green, p. 
177), we get the following result, which is a special case of what is known as Cauchy's Theorem. 



2 This content is available online at <http://cnx.Org/content/m36235/l.2/>. 
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Corollary 7.1: 

Let S be a piecewise smooth geometric set whose boundary Cs has finite length. Suppose / is a 
complex- valued function that is continuous on S and differentiable at each point of the interior S° 
of S. Then the contour integral /„ / (Q d( = 0. 

Exercise 7.1 

a. Prove the preceding corollary. See Theorem 6.12, p. 174. 

b. Suppose / = u + iv is a differentiable, complex- valued function on an open disk B r (c) in C, 
and assume that the real part u is a constant function. Prove that / is a constant function. 
Derive the same result assuming that v is a constant function. 

c. Suppose / and g are two differentiable, complex-valued functions on an open disk B r (c) in 
C. Show that, if the real part of / is equal to the real part of g, then there exists a constant 
k such that f (z) = g (z) + k, for all z € B r (c) . 

For future computational purposes, we give the following implications of the Cauchy-Riemann equations. As 
with Theorem 7.1, Cauchy-Riemann equations, p. 186, this next theorem mixes the notions of differentiability 
of a function of a complex variable and the partial derivatives of its real and imaginary parts. 

Theorem 7.2: 

Let / = u + iv be a complex-valued function of a complex variable, and suppose that / is differ- 
entiable at the point c = (a, b) . Let A be the 2x2 matrix 

A = (u x (a, b) v x (a, b) u y (a, b) v y (a, b)) . (7.6) 

Then: 

1. \f(c)\ 2 = det(A). 

2. The two vectors 

Vi = (u x (a, b) , u y (a, b)) and V2 = {v x (a, b) , v y (a, b)) (7.7) 

are linearly independent vectors in R 2 if and only if /' (c) 7^ 0. 

3. The vectors 

V3 = (u x (a, b) , v x (a, b)) and Vi = (u y (a, b) , v y (a, b)) (7.8) 

are linearly independent vectors in R 2 if and only if /' (c) 7^ 0. 

Proof: 

Using the Cauchy-Riemann equations, we see that the determinant of the matrix A is given by 



detA = u x (a, b) v y (a, b) — u y (a, b) v x (a, b) 

= (u x (a,b)) +(v x (a,b)) 

= (u x (a,b) +iv x (a,b))(u x (a,b) -iv x (a,b)) (7.9) 



/ (c) /' (c) 
l/'(c)| 2 , 



proving part (1). 



The vectors Vi and V2 are the columns of the matrix A, and so, from elementary linear algebra, we see 
that they are linearly independent if and only if the determinant of A is nonzero. Hence, part (2) follows 
from part (1). Similarly, part (3) is a consequence of part (1). 

It may come as no surprise that the contour integral of a function / around the boundary of a geometric 
set S is not necessarily if the function / is not differentiable at each point in the interior of S. However, it 



188 



CHAPTER 7. THE FUNDAMENTAL THEOREM OF ALGEBRA, AND THE 

FUNDAMENTAL THEOREM OF ANALYSIS 



is exactly these kinds of contour integrals that will occupy our attention in the rest of this chapter, and we 
shouldn't jump to any conclusions. 

Exercise 7.2 

Let c be a point in C, and let S be the geometric set that is a closed disk B r (c) . Let <j> be the 
parameterization of the boundary C r of S given by <f> (t) = c + re lt for t € [0, 2n] . For each integer 
n € Z, define /„ (z) = (z — c) . 



a. Show that J„ f n ((d( = for all n / — 1. 

b. Show that 



/-i (0 d( 



, r c- 



d( = 2-7TZ. 



(7.10) 



There is a remarkable result about contour integrals of certain functions that aren't differentiable everywhere 
within a geometric set, and it is what has been called the Fundamental Theorem of Analysis, or Cauchy's 
Theorem. This theorem has many general statements, but we present one here that is quite broad and 
certainly adequate for our purposes. 

Theorem 7.3: Cauchy's Theorem, Fundamental Theorem of Analysis 

Let S be a piecewise smooth geometric set whose boundary Cs has finite length, and let S^ S° 
be a piecewise smooth geometric set, whose boundary C~ also is of finite length. Suppose / is 

s 

continuous on S f) S , i.e., at every point z that is in S but not in S , and assume that / is 

differentiable on S° S, i.e., at every point z in S° but not in S ■ (We think of these sets as being 
the points "between" the boundary curves of these geometric sets.) Then the two contour integrals 
/c s / (0 d C and Ic\ f (0 d C are equal. 

s 

Proof: 

Let the geometric set S be determined by the interval [a, b] and the two bounding functions u 



and I, and let the geometric set S be determined by the subinterval 



a, b 



of [a, b] and the two 



bounding functions u and I . Because gC S°, we know that u (t) < u(t) and I (t) < I (t) for all 



t e 



a, b 



We define four geometric sets Si, ..., S4 as follows: 



1. Si is determined by the interval 
that interval. 

2. S2 is determined by the interval 
to that interval. 

3. S3 is determined by the interval 
that interval. 

4. S4 is determined by the interval 
that interval. 



a, a 



a, b 



a, b 



b,b 



and the two bounding functions u and I restricted to 
and the two bounding functions u and u restricted 

and the two bounding functions I and I restricted to 

and the two bounding functions u and I restricted to 



Observe that the five sets S, Si, ..., S4 constitute a partition of the geometric set S. The corollary 
to Theorem 7.1, Cauchy-Riemann equations, p. 186 applies to each of the four geometric sets 



189 

Si, ..., S4. Hence, the contour integral of / around each of the four boundaries of these geometric 
sets is 0. So, by Exercise 6.20, 

Ic s f(0dC = JcJiOdt + ZUfcsJiOdt 

s (7.11) 

IcJiQdC, 

s 

as desired. 

Exercise 7.3 

a. Draw a picture of the five geometric sets in the proof above and justify the claim that the 
sum of the four contour integrals around the geometric sets Si, ..., S4 is the integral around 
Cs minus the integral around C ~ . 

s 

b. Let Si, ...,S„ be pairwise disjoint, piecewise smooth geometric sets, each having a boundary 

of finite length, and each contained in a piecewise smooth geometric set S whose boundary 
also has finite length. Prove that the S^'s are some of the elements of a partition {Si} of 
S, each of which is piecewise smooth and has a boundary of finite length. Show that, by 

reindexing, Si, ...,S n can be chosen to be the first n elements of the partition {Si}- HINT: 
Just carefully adjust the proof of Theorem 5.25, p. 148. 

c. Suppose S is a piecewise smooth geometric set whose boundary has finite length, and let 
Si, ..., S n be a partition of S for which each S& is piecewise smooth and has a boundary Cs k 
of finite length. Suppose / is continuous on each of the boundaries Cs k of the Sfc's as well as 
the boundary Cs of S, and assume that / is continuous on each of the Sfe's, for 1 < k < m, 
and differentiable at each point of their interiors. Prove that 



/(C)dC= E / /(0<*C- (7.12) 



Cs fe=m+l Cs k 

Prove the following generalization of the Cauchy Theorem: Let Si, ..., S n be pairwise disjoint, 
piecewise smooth geometric sets whose boundaries have finite length, all contained in the 
interior of a piecewise smooth geometric set S whose boundary also has finite length. Suppose 
/ is continuous at each point of S that is not in the interior of any of the Sfe's, and that / is 
differentiable at each point of S° that is not an element of any of the S^'s. Prove that 

n ~ 

f(()d( = J2 /(OdC- (7-13) 

°s k=l Cs k 

Perhaps the main application of Theorem 7.3, Cauchy's Theorem, Fundamental Theorem of Analysis, p. 188 
is what's called the Cauchy Integral Formula. It may not appear to be useful at first glance, but we will be 
able to use it over and over throughout this chapter. In addition to its theoretical uses, it is the basis for a 
technique for actually evaluating contour integrals, line integrals, as well as ordinary integrals. 

Theorem 7.4: Cauchy Integral Formula 

Let S be a piecewise smooth geometric set whose boundary Cs has finite length, and let / be a 
continuous function on S that is differentiable on the interior S° of S. Then, for any point z E S°, 
we have 

f(z) = ^-f PQ<K. (7.14) 
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REMARK This theorem is an initial glimpse at how differentiable functions of a complex variable 
are remarkably different from differentiable functions of a real variable. Indeed, Cauchy's Integral 
Formula shows that the values of a differentiable function / at all points in the interior of a 
geometric set S are completely determined by the values of that function on the boundary of the 
set. The analogous thing for a function of a real variable would be to say that all the values of a 
differentiable function / at points in the open interval (a, b) are completely determined by its values 
at the endpoints a and b. This is patently absurd for functions of a real variable, so there surely is 
something marvelous going on for differentiable functions of a complex variable. 
Proof: 

Let r be any positive number such that B r (z) is contained in the interior S° of S, and note that the 

close disk B r (z) is a piecewise smooth geometric set S contained in S°. We will write C r instead of 
C~ for the boundary of this disk, and we will use as a parameterization of the curve C r the function 

S 
<f> : [0, 2n] — ► C r given by <f> (£) = z + re lt . Now the function g(Q = f (C) / (C ~~ z ) ls continuous on 

~o ~ 

SC\S and differentiable on S°P\S, so that Theorem 7.3, Cauchy's Theorem, Fundamental Theorem 
of Analysis, p. 188 applies to the function g. Hence 

^-Jc s iBdC = ^-J Cs g(0dC 



2-ni J() z-\-re zz —z 



= &Cf(* + re«)dt. 

Since the equality established above is valid, independent of r, we may take the limit as r goes to 
0, and the equality will persist. We can evaluate such a limit by replacing the r by 1/n, in which 
case we would be evaluating 

-1 fllV / 1 \ 1 p2ir 



'! - ^oo 2?T 



,2* 


f 1 »\ 




1 


/ 


z+-e lt ) 


i dt = 


lim — 


1 


\ n J 




n— >oo 2lT 



lim — / f[z+ -e lt dt= lim — f n (t) dt, (7.16) 



o 



where /„ (t) = f (z + fraclne lt ) . Finally, because the function / is continuous at the point z, it 
follows that the sequence {/„} converges uniformly to the constant function / (z) on the interval 
[0,27r] . So, by Theorem 5.6, we have that 



1 r2ir -, i-1-n 



lim— f n (t)dt=— f(z)dt=f(z). (7.17) 

Therefore, 

J-f IiL dC = l im J- f"f(z + re it )dt = f(z), (7.18) 

and the theorem is proved. 

The next exercise gives two simple but strong consequences of the Cauchy Integral Formula, and it would 
be wise to spend a few minutes deriving other similar results. 

Exercise 7.4 

a. Let S and / be as in the preceding theorem, and assume that / (z) = for every point on 
the boundary Cs of S. Prove that f (z) = for every z € S. 
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b. Let S be as in part (a), and suppose that / and g are two continuous functions on S, both 
differentiable on S°, and such that / (£) = g (£) for every point on the boundary of S. Prove 
that f (z) = g (z) for all z £ S. 

The preceding exercise shows that two differentiable functions of a complex variable are equal everywhere 
on a piecewise smooth geometric set S if they agree on the boundary of the set. More is true. We will see 
below in the Identity Theorem that they are equal everywhere on a piecewise smooth geometric set S if they 
agree just along a single convergent sequence in the interior of S. 

Combining part (b) of Exercise 7.3, Exercise 6.20, and Theorem 7.3, Cauchy's Theorem, Fundamental 
Theorem of Analysis, p. 188, we obtain the following corollary: 

Corollary 7.2: 

Let Si, ...,S n be pairwise disjoint, piecewise smooth geometric sets whose boundaries have finite 
length, all contained in the interior of a piecewise smooth geometric set S whose boundary has 
finite length. Suppose / is continuous at each point of S that is not in the interior of any of the 
Sfe's, and that / is differentiable at each point of S° that is not an element of any of the Sfe's. Then, 
for any z £ S° that is not an element of any of the S^'s, we have 




/W = S3 T^Z d C-}l f^-dC ■ (7-19) 



Proof: 

Let r > be such that B r (z) is disjoint from all the Sfc's. By part (b) of Exercise 7.3, let 7\, ..., T m 
be a partition of S such that Tf. = Sk for 1 < k < n, and T„ +1 = B r (z) . By Exercise 6.20, we 
know that 

[ liO dC = yf liO dC (7 . 2 o) 



fe=l" °T fc 



From the Cauchy Integral Formula, we know that 

/(C) 



C-z 



d( = 2TTif{z). (7.21) 



Also, since / (£) / (Q — z) is differentiable at each point of the interior of the sets Tk for k > n + 1, 
we have from Theorem 7.2, p. 187 that for all k > n + 1 

f(C) 
f-^dC = 0. (7.22) 

Ct k C " z 
Therefore, 

7^- rf C = E I P^d(+ 2-Kif (z) , (7.23) 

c s Q-z k=1 Jc Sk Q-z 

which completes the proof. 

Exercise 7.5 

Suppose S is a piecewise smooth geometric set whose boundary has finite length, and let c\, ..., c n 
be points in S°. Suppose / is a complex- valued function that is continuous at every point of S 
except the CVs and differentiable at every point of S° except the Cfe's. Let r\,...,r n be positive 
numbers such that the disks {Bn k (cfe)} are pairwise disjoint and all contained in S°. 
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a. Prove that 



/ f(()d(,= J2[ /(C)dC (7-24) 



where C& denotes the boundary of the disk B rk (cfc) . 
b. For any z € S° that is not in any of the closed disks B rk (ck) , show that 



f W- dC = 2 „if(z) + Y i f ^dC. (7.25) 

J Cs^~ Z k=1 J C fc <> - z 

c. (c) Specialize part (b) to the case where S = B r (c) , and / is analytic at each point of B r (c) 
except at the central point c. For each z / c in B r (c) , and any < 6 < \z — c\, derive the 
formula 

2tt«7 c r (- z 2m J c 5 C- z 



7.3 Basic Applications of the Cauchy Integral Formula 3 

As a major application of the Cauchy Integral Formula, let us show the much alluded to remarkable fact that 
a function that is a differentiable function of a complex variable on an open set U is actually expandable in 
a Taylor series around every point in U, i.e., is an analytic function on U. 

Theorem 7.5: 

Suppose / is a differentiable function of a complex variable on an open set U C C, and let c be 
an element of U. Then / is expandable in a Taylor series around c. In fact, for any r > for which 
B r (c) C U, we have 

oo 

f{z) = Y J *n{z-c) n (7.27) 

n=0 

for all z € B r (c) . 
Proof: 

Choose an r > such that the closed disk B r (c) C U, and write C r for the boundary of this disk. 
Note that, for all points £ on the curve C r , and any fixed point z in the open disk B r (c) , we have 
that \z — c| <r= |£ — c|, whence |z — c\/\C, — c\ = \z — c\/r < 1. Therefore the geometric series 

oo / \ n -i 

Yl ( t^t~ ) conver s es to i z -c - ( 7 - 28 ) 

„=o V' c / - 1 c-c 

Moreover, by the Weierstrass M-Test, as functions of the variable £, this infinite series converges 
uniformly on the curve C r . We will use this in the calculation below. Now, according to Theorem 7.4, 



3 This content is available online at <http://cnx.Org/content/m36237/l.2/>. 
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Cauchy Integral Formula, p. 189, we have that 



JL f /(O dC 
&Jc r ^l£Lo (?=§)> (7-29) 

~~ 2-iriJ C r l^n=0 (C-c) n + 1 ^ ' ^ 

- J_V°° f /(C) ( z -c) n dC 

— 2iri l^n=0 J C r (£_ c )»+i ^ > ^ 

n=0 a„{z-c) , 

where we are able to bring the summation sign outside the integral by part (3) of Theorem 6.10, 
p. 169, and where 

This proves that / is expandable in a Taylor series around the point c, as desired. 

Using what we know about the relationship between the coefficients of a Taylor series and the derivatives of 
the function, together with the Cauchy Integral Theorem, we obtain the following formulas for the derivatives 
of a differentiable function / of a complex variable. These are sometimes also called the Cauchy Integral 
Formulas. 

Corollary 7.3: 

Suppose / is a differentiable function of a complex variable on an open set U, and let c be an 
element of U. Then / is infinitely differentiable at c, and 

f (n)^ ™ ! f /(O 



fW (c) = — / ^^^rr d(, (7.31) 

for any piecewise smooth geometric set S C U whose boundary Cs has finite length, and for which 
c belongs to the interior S° of S. 

Exercise 7.6 

a. Prove the preceding corollary. 

b. Let /, U, and c be as in Theorem 7.5, p. 192. Show that the radius of convergence r of the 
Taylor series expansion of / around c is at least as large as the supremum of all s for which 
B s (c) C U. 

c. Conclude that the radius of convergence of the Taylor series expansion of a differentiable 
function of a complex variable is as large as possible. That is, if / is differentiable on a disk 
B r (c) , then the Taylor series expansion of / around c converges on all of B r (c) . 

d. Consider the real-valued function of a real variable given by / (x) = 1/ (l + x 2 ) . Show that / 
is differentiable at each real number x. Show that / is expandable in a Taylor series around 0, 
but show that the radius of convergence of this Taylor series is equal to 1. Does this contradict 
part (c)? 

e. Let / be the complex-valued function of a complex variable given by f (z) = 1/ (l + z 2 ) . 
We have just replaced the real variable x of part (d) by a complex variable z. Explain the 
apparent contradiction that parts (c) and (d) present in connection with this function. 

Exercise 7.7 
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a. Let S be a piecewise smooth geometric set whose boundary Cs has finite length, and let / 
be a continuous function on the curve Cs- Define a function F on S° by 

F(z)= [ P^-d(. (7.32) 

Prove that F is expandable in a Taylor series around each point c & S°. Show in fact that 
F (z) = Yl a n{z — c) n for all z in a disk B r (c) C S°, where 

f(C) 

T d(. (7.33) 



2iriJ Cs{C - c) 

HINT: Mimic the proof of Theorem 7.5, p. 192. 

b. Let / and F be as in part (a). Is F defined on the boundary Cs of S? If z belongs to the 
boundary Cs, and z = limz n , where each z n € S°, Does the sequence {F (z n )} converge, and, 
if so, does it converge to / (z)l 

c. Let S be the closed unit disk B\ (0) , and let / be defined on the boundary C\ of this disk 
by / (z) = z, i.e., / (x + iy) = x — iy. Work out the function F of part (a), and then re-think 
about part (b). 

d. Let / and F be as in part (a). If, in addition, / is continuous on all of S and differentiable on 
S°, show that F (z) = 2irif (z) for all z € 5°. Think about this "magic" constant 2ni. Review 
the proof of the Cauchy Integral Formula to understand where this constant comes from. 

Theorem 3.15, THEOREM 3.14159 (Definition of it), p. 73 and Exercise 3.26 constitute what we called 
the "identity theorem" for functions that are expandable in a Taylor series around a point c. An even stronger 
result than that is actually true for functions of a complex variable. 

Theorem 7.6: Identity Theorem 

Let / be a continuous complex- valued function on a piecewise smooth geometric set S, and assume 
that / is differentiable on the interior S° of S. Suppose {zk} is a sequence of distinct points in S° 
that converges to a point c in 5°. If / (zu) = for every K, then / (z) = for every z € S. 
Proof: 

It follows from Exercise 3.26 that there exists an r > such that / (z) = for all z € B r (c) . Now 
let w be another point in 5°, and let us show that / (w) must equal 0. Using part (f) of Exercise 6.2, 



let 



a, b 



C be a piecewise smooth curve, joining c to w, that lies entirely in S°. Let A be the 



set of all t € 



a, b 



such that / (4>(s)) = for all s s 



a,t\ . We claim first that A is nonempty. 



Indeed, because <j) is continuous, there exists an e > such that \<j> (s) — c\ = \</>(s) — 4> \ a ) | < r 
if \s— a | < e. Therefore f(<f>(s)) = for all s e 



a, a +e ) , whence, a +e € A. Obviously, A 

is bounded above by b, and we write to for the supremum of A. We wish to show that to =b, 
whence, since <f> is continuous at B,f (w) = f I <j> I b I I = /(<A(£o)) = 0. Suppose, by way of 

contradiction, that to < b, and write zq = <f>(to) . Now zq s S°, and zq = lim<j>(to — 1/fc) because 
4> is continuous at to- But / (</> (to — l/k)) = for all k. So, again using Exercise 3.26, we know that 
there exists an r > such that f (z) = for all z € B r < (z ) • As before, because <j> is continuous 

at to, there exists a 5 > such that to + S < b and \<j)(s) — c/>(to) | < r if \s — to\ < 6. Hence, 
/ (4> (s)) = for all s s (to — S, to + 6) , which implies that to + $ belongs to A. But then to could 
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not be the supremum of A, and therefore we have arrived at a contradiction. Consequently, to =b, 
and therefore / (w) = for all w G S°. Of course, since every point in S is a limit of points from 
S°, and since / is continuous on S, we see that / (z) = for all z e S, and the theorem is proved. 

The next exercise gives some consequences of the Identity Theorem. Part (b) may appear to be a contrived 
example, but it will be useful later on. 

Exercise 7.8 

a. Suppose / and g are two functions, both continuous on a piecewise smooth geometric set S 
and both differentiable on its interior. Suppose {zk} is a sequence of elements of S° that 
converges to a point c G S°, and assume that / (zk) = 9 {zk) for all k. Prove that / (z) = g (z) 
for all z G S. 

b. Suppose / is a nonconstant differentiable function defined on the interior of a piecewise smooth 
geometric set S. If c G S° and B e (c) C S°, show that there must exist an < r < e for which 
/ ( c ) / / i z ) f° r & H z on the boundary of the disk B r (c) . 



7.4 The Fundamental Theorem of Algebra 4 

We can now prove the Fundamental Theorem of Algebra, the last of our primary goals. One final trumpet 
fanfare, please! 

Theorem 7.7: Fundamental Theorem of Algebra 

Let p (z) be a nonconstant polynomial of a complex variable. Then there exists a complex number 
zq such that p(zo) = 0. That is, every nonconstant polynomial of a complex variable has a root in 
the complex numbers. 
Proof: 

We prove this theorem by contradiction. Thus, suppose that p is a nonconstant polynomial of 
degree n > 1, and that p(z) is never 0. Set f (z) = l/p(z), and observe that / is defined and 
differentiable at every point z € C. We will show that / is a constant function, implying that 
p = 1// is a constant, and that will give the contradiction. We prove that / is constant by showing 
that its derivative is identically 0, and we compute its derivative by using the Cauchy Integral 
Formula for the derivative. 

From part (4) of Theorem 3.1, p. 57, we recall that there exists a B > such that ■^ L '-|2;| n < 
\p (z) |, for all z for which \z\ > B, and where c„ is the (nonzero) leading coefficient of the polynomial 
p. Hence, |/ (z) \ < Mn for all \z\ > B, where we write M for 2/\c n \. Now, fix a point c G C. Because 
/ is differentiable on the open set U = C, we can use the corollary to Theorem 7.4, Cauchy Integral 
Formula, p. 189 to compute the derivative of / at c by using any of the curves C r that bound the 
disks B r (c) , and we choose an r large enough so that \c + re lt \ > B for all < t < 2n. Then, 



l/MI = \^l!c,7B^dC\ 



< 



2-niJ C r (C-c) 2 
^ I JO ( c+re «- c ) 2 * re m \ 

£- r f Q 2 «\f(c + re«)\dt (7-34) 



< J_ f 27r M ,jt 

— 2-xr JO Ic+re"!" " 



< 



2-kt JO |c+re"| r 

M 



4 This content is available online at <http://cnx.Org/content/m36238/l.2/>. 
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Hence, by letting r tend to infinity, we get that 

M 
\f (c) | < Um — = 0, (7.35) 

r— >oo TJD 

and the proof is complete. 

REMARK 7.1: 

The Fundamental Theorem of Algebra settles a question first raised back in Section 1.1. There, 
we introduced a number I that was a root of the polynomial x 2 + 1. We did this in order to build 
a number system in which negative numbers would have square roots. We adjoined the "number" 
i to the set of real numbers to form the set of complex numbers, and we then saw that in fact 
every complex number z has a square root. However, a fear was that, in order to build a system 
in which every number has an nth root for every n, we would continually need to be adjoining new 
elements to our number system. However, the Fundamental Theorem of Algebra shows that this 
is not necessary. The set of complex numbers is already rich enough to contain all nth roots and 
even more. 

Practically the same argument as in the preceding proof establishes another striking result. 

Theorem 7.8: Liouville 

Suppose / is a bounded, everywhere differentiable function of a complex variable. Then / must 
be a constant function. 

Exercise 7.9 

Prove Liouville's Theorem. 



7.5 The Maximum Modulus Principle 5 

Our next goal is to examine so-called "max/min" problems for coplex- valued functions of complex variables. 
Since order makes no sense for complex numbers, we will investigate max/min problems for the absolute 
value of a complex- valued function. For the corresponding question for real-valued functions of real variables, 
we have as our basic result the First Derivative Test (Theorem 4.8, First Derivative Test for Extreme Values, 
p. 92). Indeed, when searching for the poinhts where a differentiable real- valued function / on an interval 
[a, b] attains its extreme values, we consider first the poinhts where it attains a local max or min, to which 
purpose end Theorem 4.8, First Derivative Test for Extreme Values, p. 92 is useful. Of course, to find the 
absolute minimum and maximum, we must also check the values of the function at the endpoints. 

An analog of Theorem 4.8, First Derivative Test for Extreme Values, p. 92 holds in the complex case, 
but in fact a much different result is really valid. Indeed, it is nearly impossible for the absolute value of a 
differentiable function of a complex variable to attain a local maximum or minimum. 

Theorem 7.9: 

Let / be a continuous function on a piecewise smooth geometric set S, and assume that / is 
differentiable on the interior S° of S. Suppose c is a point in S° at which the real- valued function 
|/| attains a local maximum. That is, there exists an e > such that |/(c) | > \f (z) | for all z 
satisfying \z — c\ < e. Then / is a constant function on S; i.e., / (z) = / (c) for all z € S. In other 
words, the only differentiable functions of a complex variable, whose absolute value attains a local 
maximum on the interior of a geometric set, are constant functions on that set. 
Proof: 

If / (c) = 0, then / (z) = for all z € B £ (c) . Hence, by the Identity Theorem (Theorem 7.6, Identity 
Theorem, p. 194), / (z) would equal for all z £ S. so, we may as well assume that /(c) ^ 0. 
Let r be any positive number for which the closed disk B r (c) is contained in B e (c) . We claim first 



5 This content is available online at <http://cnx.Org/content/m36239/l.2/>. 
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that there exists a point z on the boundary C r of the disk B r (c) for which \f (z) | = \f (c) |. Of 
course, \f (z\ < \f (c) | for all z on this boundary by assumption. By way of contradiction, suppose 
that |/ (C) | < 1/ (c) | for all ( on the boundary C r of the disk. Write M for the maximum value of 
the function |/| on the compact set C r . Then, by our assumption, M < |/(c) |. Now, we use the 
Cauchy Integral Formula: 



/(C) 

. C-c 

r27 r /( c+re ") ; 
re xt 

^C\f(c+re*)\dt 

M 
l/(c)|, 



l/MI = I &/*.££<". I 

< 
< 

< 



(7.36) 



and this is a contradiction. 

Now for each natural number n for which 1/n < e, let z n be a point for which \z n — c\ = l/n and 
|/ (z n ) | = |/ (c) |. We claim that the derivative /' (z n ) of / at z n = for all n. What we know is that 

the real-valued function F (x, y) = \f (x + iy) \ = ( u (x, y) + (v (x, y)) attains a local maximum 

value at z n = (x n ,y n ) . Hence, by Exercise 4.34, both partial derivatives of F must be at (x n , y n ) . 
That is 



and 



tialu tialv 

2m (x n , y„) —— (x n ,y n ) + 2v (x n , y n ) —— (x n , y„) = 
tialx tialx 



tialu tialv 

2u(x n ,y n ) —— {x n ,y n ) +2v(x n ,y n ) —— {x n ,y n ) = 0. 
tialy tialy 



Hence the two vectors 



and 



-» (tialu tialv 

Vl = I T7ZTZ ( x n, Vn) , Tr~r \ x n, Vn , 



\ tialx 



tialx 



Vi = ( 



V 



tialu 



\tialy 



\%ni Vn) 



tialv 
tialy 



V-^n: yr< 



(7.37) 



(7.38) 



(7.39) 



(7.40) 



are both perpendicular to the vector V3 = (u(x n ,y n ) ,v (x n ,y n j) . But V3 7^ 0, because || V3 || = 
|/ (z n ) I = |/ (c) I > 0, and hence Vi and Vi are linearly dependent. But this implies that /' (z n ) = 
0, according to Theorem 7.2, p. 187. 

Since c = limz n , and /' is analytic on 5°, it follows from the Identity Theorem that there exists 
an r > such that /' (z) = for all z € B r (c) . But this implies that / is a constant / (z) = f (c) 
for all z € B r (c) . And thenm, again using the Identity Theorem, this implies that / (z) = / (c) for 
all z g S, which completes the proof. 



7.2: 

REMARK Of course, the preceding proof contains in it the verification that if |/| attains a 
maximum at a point c where it is differentiable, then /' (c) = 0. This is the analog for functions 
of a complex variable of Theorem 4.8, First Derivative Test for Extreme Values, p. 92. But, 
Theorem 7.9, p. 196 certainly asserts a lot more than that. In fact, it says that it is impossible for 
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the absolute value of a nonconstant differentiable function of a complex variable to attain a local 
maximum. Here is the coup d'gras: 

Corollary 7.4: Maximum Modulus Principle 

Let / be a continuous, nonconstant, complex-valued function on a piecewise smooth geometric set 
S, and suppose that / is differentiable on the interior 5° of S. Let M be the maximum value of the 
continuous, real- valued function |/| on S, and let z be a point in S for which \f (z) \ = M. Then, z 
does not belong to the interior S° of 5; it belongs to the boundary of S. In other words, |/| attains 
its maximum value only on the boundary of S. 

Exercise 7.10 

a. Prove the preceding corollary. 

b. Let / be an analytic function on an open set U, and let c e U be a point at which |/| achieves 
a local minimum; i.e., there exists an e > such that \f (c) | < \f (z) | for all z € B 6 (c) . Show 
that, if / (c) j£ 0, then / is constant on B e (c) . Show by example that, if / (c) = 0, then / 
need not be a constant on B e (c) . 

c. Prove the "Minimum Modulus Principle:" Let / be a nonzero, continuous, nonconstant, func- 
tion on a piecewise smooth geometric set S, and let m be the minimum value of the function 
|/| on S. If z is a point of S at which this minimum value is atgtained, then z belongs to the 
boundary Cs of S. 



7.6 The Open Mapping Theorem and the Inverse Function Theorem 6 

We turn next to a question about functions of a complex variable that is related to Theorem 4.10, Inverse 
Function Theorem, p. 95, the Inverse Function Theorem. That result asserts, subject to a couple of 
hypotheses, that the inverse of a one-to-one differentiable function of a real variable is also differentiable. 
Since a function is only differentiable at points in the interior of its domain, it is necessary to verify that 
the point / (c) is in the interior of the domain / (S) of the inverse function / _1 before the question of 
differentiability at that point can be addressed. And, the peculiar thing is that it is this point about / (c) 
being in the interior of / (S) that is the subtle part. The fact that the inverse function is differentiable there, 
and has the prescribed form, is then only a careful e — 5 argument. For continuous real- valued functions of 
real variables, the fact that / (c) belongs to the interior of / (5) boils down to the fact that intervals get 
mapped onto intervals by continuous functions, which is basically a consequence of the Intermediate Value 
Theorem. However, for complex-valued functions of complex variables, the situation is much deeper. For 
instance, the continuous image of a disk is just not always another disk, and it may not even be an open set. 
Well, all is not lost; we just have to work a little harder. 

Theorem 7.10: Open Mapping Theorem 

Let S be a piecewise smooth geometric set, and write U for the (open) interior 5° of S. Suppose 
/ is a nonconstant differentiable, complex-valued function on the set U. Then the range / (U)oi f 
is an open subset of C. 
Proof: 

Let c be in U. Because / is not a constant function, there must exist an r > such that /(c)// (z) 
for all z on the boundary C r of the disk B r (c) . See part (b) of Exercise 7.8. Let zq be a point in 
the compact set C r at which the continuous real- valued function \f (z) — f (c) | attains its minimum 
value s. Since / (z) ^ /(c) for any z € C r , we must have that s > 0. We claim that the disk 
B s /2 (/ (c)) belongs to the range / (U) of /. This will show that the point / (c) belongs to the 
ihnterior of the set / (U) , and that will finish the proof. 



6 This content is available online at <http://cnx.Org/content/m36240/l.2/>. 
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By way of contradiction, suppose B s / 2 (/ (c) is not contained in / (U) ,, and let w e B s / 2 (/ (c)) 
be a complex number that is not in f (U) . We have that \w — f (c) \ < s/2, which implies that 
\w — f (z) \ > s/2 for all z € C r . Consider the function g defined on the closed disk B r (c) by 
g (z) = 1/ (w — f (z)) . Then g is continuous on the closed disk B r (c) and differentiable on B r (c) . 
Moreover, g is not a constant function, for if it were, / would also be a constant function on B r (c) 
and therefore, by the Identity Theorem, constant on all of U, whichg is not the case by hypothesis. 
Hence, by the Maximum Modulus Principle, the maximum value of \g\ only occurs on the boundary 
C r of this disk. That is, there exists a point z € C r such that \g (z) \ < \g (V) | for all z € B r (c) . 
But then 

2 11 11 , 

" = ~^ < i 77T7 < i 7T^V\ - "' 7 - 41 

s s/2 \w-f(c)\ \w-f(z)\ s 

which gives the desired contradiction. Therefore, the entire disk B s / 2 (/ (c)) belongs to / (U) , and 
hence the point / (c) belongs to the interior of the set / (U) . Since this holds for any point c € U, 
it follows that / (U) is open, as desired. 

Now we can give the version of the Inverse Function Theorem for complex variables. 

Theorem 7.11: 

Let S be a piecewise smooth geometric set, and suppose / : S — > C is continuously differentiable 
at a point c = a + bi, and assume that /' (c) ^ 0. Then: 

1. There exists an r > 0, such that B r (c) C S, for which / is one-to-one on B r (c) . 

2. / (c) belongs to the interior of / (5) . 

3. If g denotes the restriction of the function / to B r (c) , then g is one-to-one, g~ l is differentiable 
at the point / (c) , and g~ x (/ (c) = 1/ '/' (c) . 

Proof: 

Arguing by contradiction, suppose that / is not one-to-one on any disk B r (c) . Then, for each 
natural number n, there must exist two points z n = x„ + iy n and z n = x n + iy n such that 
\z n — c\ < l/n,\z n — c\ < 1/n, and / (z n ) = f (z n ) . If we write f = u + iv, then we would have that 
u (x n , y n ) — u (x' n , y' n ) = for all n. So, by part (c) of Exercise 4.35, there must exist for each n a 



point I x n , y n I , such that I x n , y n I is on the line segment joining z n and z n , and for which 

/ ' ' \ tialu (~ " \ . , N tialu (~ " \ , , N 

= u{x n ,y n ) -u{x n ,y n ) = -^-^ \x n ,y n \ [x n - x n ) + j—j- \x n ,y n j (y n - y n ) . (7.42) 

Similarly, applying the same kind of reasoning to v, there must exist points (x n , y n ) on the segment 
joining z n to z' n such that 

tialv , i , tialv , . n 

= j- -^ (x n , y n ) (Xn - x n ) + j-j- (x n ,y n ) (j/„ - y n ) . (7.43) 

If we define vectors JJn an d Vn by 

— > /tialu /" \ tialu (~ " \\ , 

u n = [Tr-r \x n ,y n ), -—- [x n ,y n )\ (7.44) 

\tialx \ J tialy \ / / 



and 



-> /tialv _ tialv _ , 

Vn= {ludx- {Xn ' yn) >UaTy {Xn > yn) ^ (7 - 45) 
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then we have that both {/„ and V n are perpendicular to the nonzero vector ((x n — x n ) , (y n — y' n )) . 
Therefore, U n an d V n are linearly dependent, whence 

//tialw //" \ tialu /" \ tialv . . tialw , _ „ .\\ 

Now, since both {x n + iy n } and {x n + iy n } converge to the point c = a + ib, and the partial 
derivatives of u and v are continuous at c, we deduce that 

//tialu , , tialw , , tialw , , tialw , ,\\ „ 

rfe H(^ (a ' 6) ^ (a ' 6) ^ (a ' 6) ^ (a ' 6) )) =0 - i7A7) 

Now, from Theorem 7.2, p. 187, this would imply that /' (c) = 0, and this is a contradiction. 
Hence, there must exist an r > for which / is one-to-one on B r (c) , and this proves part (1). 

Because / is one-to-one on B r (c) ,/ is obviously not a constant function. So, by the Open 
Mapping Theorem, the point / (c) belongs to the interior of the range of /, and this proves part 
(2). 

Now write g for the restriction of / to the disk B r (c) . Then g is one-to-one. According to part 
(2) of Theorem 4.2, p. 84, we can prove that <? _1 is differentiable at / (c) by showing that 

Um ,-(,)-»-■(/(.)) _ i 

*-/(«=) z -/(c) / (c) 

That is, we need to show that, given an e > 0, there exists a 5 > such that if < \z — f (c)\ < 5 
then 

\ 9- 1 (*)-9- 1 (f(c)) _J_, <e (?49) 

1 z-f(c) f(c) l<£ - U " 4yj 

First of all, because the function 1/w is continuous at the point /' (c) , there exists an e > such 
that if \w — /' (c) | < e , then 

I--t4tI< £ - ( 7 - 5 °) 

w j (c) 

Next, because / is differentiable at c, there exists a 8' > such that if < \y — c\ < d' then 

J/-C 

Now, by Theorem 3.11, p. 67, g _1 is continuous at the point /(c) , and therefore there exists a 
5 > such that if \z — f (c) | < 6 then 

\g- l {z)-g- l (f{c)\<5'.{l.S2) 
So, if |z - / (c) | < 5, then 

IS-'to-clH^M-^t/tc))!^'. (7.53) 

But then, 

J{g-Hz))-f{c) 



g 1 {z)- c 



f (c) | < e , (7.54) 



from which it follows that 

, 9-H*)-9-Hf{c)) 1 

1 z-f(c) /'(c) 

as desired. 
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< e, (7.55) 



7.7 Uniform Convergence of Analytic Functions 7 

Part (c) of Exercise 4.26 gives an example showing that the uniform limit of a sequence of differentiable 
functions of a real variable need not be differentiable. Indeed, when thinking about uniform convergence of 
functions, the fundamental result to remember is that the uniform limit of continuous functions is continuous 
(Theorem 3.18, The uniform limit of continuous functions is continuous., p. 76). The functions in Exer- 
cise 4.26 were differentiable functions of a real variable. The fact is that, for functions of a complex variable, 
things are as usual much more simple. The following theorem is yet another masterpiece of Weierstrass. 

Theorem 7.12: 

Suppose U is an open subset of C, and that {/„} is a sequence of analytic functions on U that con- 
verges uniformly to a function /. Then / is analytic on U. That is, the uniform limit of differentiable 
functions on an open set U in the complex plane is also differentiable on U. 
Proof: 

Though this theorem sounds impressive and perhaps unexpected, it is really just a combination of 
Theorem 6.10, p. 169 and the Cauchy Integral Formula. Indeed, let c be a point in U, and let r > 
be such that B r (c) C U. Then the sequence {/„} converges uniformly to / on the boundary C r of 
this closed disk. Moreover, for any z G B r (c) , the sequence {/„ (£) / {Q — z)} converges uniformly 
t° / (C) / (C ~~ z ) on Cr- Hence, by Theorem 6.10, p. 169, we have 

f(z) = Umfn (z) 

= i r r m dc 

2-KtJ C r Q — z ^ 

Hence, by part (a) of Theorem 7.7, Fundamental Theorem of Algebra, p. 195, / is expandable in 
a Taylor series around c, i.e., / is analytic on U. 



7.8 Isolated Singularities, and the Residue Theorem 8 

The first result we present in this section is a natural extension of Theorem 7.3, Cauchy's Theorem, Fun- 
damental Theorem of Analysis, p. 188. However, as we shall see, its consequences for computing contour 
integrals can hardly be overstated. 

Theorem 7.13: 

Let S be a piecewise smooth geometric set whose boundary Cs has finite length. Suppose c\, ..., c n 
are distinct points in the interior S° of 5, and that n, ...,r n are positive numbers such that the 
closed disks {B rk (cfc)} are contained in S° and pairwise disjoint. Suppose / is continuous on 
S \ UB rk (cfc) , i.e., at each point of S that is not in any of the open disks B Th (cfc) , and that / 



7 This content is available online at <http://cnx.Org/content/m36241/l.2/>. 
8 This content is available online at <http://cnx.Org/content/m36242/l.2/>. 
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is differentiable on S° \ U£> rfc (ck) , i.e., at each point of S° that is not in any of the closed disks 
B rk (cfc) • Write Ck for the circle that is the boundary of the closed disk B rk (cfc) . Then 

/ f(0d( = J2[ KOdC (7.57) 

•* Cs k—1 ^ k 

Proof: 

This is just a special case of part (d) of Exercise 7.3. 

Let / be continuous on the punctured disk B' r (c) , analytic at each point z in B r (c) , and suppose / 
is undefined at the central point c. Such points c are called isolated singularities of /, and we wish now to 
classify these kinds of points. Here is the first kind: 

Definition 7.1: 

A complex number c is called a removable singularity of an analytic function / if there exists an 
r > such that / is continuous on the punctured disk B' r (c) , analytic at each point in B r (c) , and 
lim z ^ c f (z) exists. 

Exercise 7.11 

a. Define / (z) = sinz/z for all z / 0. Show that is a removable singularity of /. 

b. For z / c, define / (z) = (1 — cos (z — cj) / (z — c) . Show that c is a removable singularity of 

/• 

c. For z y^ c, define / (z) = (1 — cos (z — c)) j{z — c) . Show that c is still a removable singularity 
of/. 

d. Let g be an analytic function on B r (c) , and set / (z) = (g (z) — g (c)) / (z — c) for all z e 
B r (c) . Show that c is a removable singularity of /. 

The following theorem provides a good explanation for the term "removable singularity." The idea is that 
this is not a "true" singularity; it's just that for some reason the natural definition of / at c has not yet been 
made. 

Theorem 7.14: 

Let / be continuous on the punctured disk B r (c) and differentiable at each point of the open 
punctured disk B r (c) , and assume that c is a removable singularity of /. Define / by / (z) = f (z) 
for all z S B r (c) , and / (c) = lim z ^ c f (z) . Then 

1. /is analytic on the entire open disk B r (c) , whence 

DC 

f{z) = Y J C k {z-c) k (7.58) 



,k 

Z k [Z - c 
k=0 



for all z g B r (c) . 
2. For any piecewise smooth geometric set S C B r (c) , whose boundary Cs has finite length, 
and for which c e S°, 

f(C)dC = 0. (7.59) 

C s 

Proof: 

As in part (a) of Exercise 7.7, define F on B r (c) by 

F(z)=^-f P^-dC. (7.60) 

Then, by that exercise, F is analytic on B r (c) . We show next that F (z) = f (z) on B r (c) , and 
this will complete the proof of part (1). 
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Let z be a point in B r (c) that is not equal to c, and let e > be given. Choose 6 > such that 
S < \z — c|/2 and such that \f (() — f (c) | < e if |C — c\ < S. Then, using part (c) of Exercise 7.5, 
we have that 



/(*) =/(*) 



2mJC r C-z Ul > 1-KiJC s C,-z "^ 



(7.61) 



>■ ' 27T2 J C5 C"- 2 27T2J Cg C — 2 

iT( z )_ 1 f /(0-/(c) rfC 

V / 2-7T2 J O5 C~ z 

where the last equality holds because the function / (c) / (Q — z) is an analytic function of £ on the 
disk B$ (c) , and hence the integral is by Theorem 7.3, Cauchy's Theorem, Fundamental Theorem 
of Analysis, p. 188. So, 

\f(z)-F(z)\ = |i f fiQzMdcl 



< if l/(0-/(c)| ds 

— 2-nJCn \C-z\ 



< 



2-KiJ C s _ C-z 
Cs \C-z\ 

hlcm** (7 - 62) 

2e. 

Since this holds for arbitrary e > 0, we see that f (z) = F (z) for all z ^= c in B r (c) . 
Finally, since 

/ (c) = Hm/ (z) = limz -» cF (z) = F (c) , (7.63) 

z — *c 

the equality of F and / on all of B r (c) is proved. This finishes the proof of part (1). 

Exercise 7.12 

Prove part (2) of the preceding theorem. 

Now, for the second kind of isolated singularity: 

Definition 7.2: 

A complex number c is called a pole of a function / if there exists an r > such that / is 
continuous on the punctured disk B' r (c) , analytic at each point of B r (c) , the point c is not a 
removable singularity of /, and there exists a positive integer k such that the analytic function 
(z — c) f (z) has a removable singularity at c. 

A pole c of / is said to be of order n, if n is the smallest positive integer for which the function 
/ (z) = (z — c) n f (z) has a removable singularity at c. 

Exercise 7.13 

a. Let c be a pole of order n of a function /, and write / (z) = (z — c) n f (z) . Show that / is 
analytic on some disk B r (c) . 

b. Define / (z) = sinz/z 3 for all z/0. Show that is a pole of order 2 of /. 

Theorem 7.15: 

Let / be continuous on a punctured disk B' r (c) , analytic at each point of B r (c) , and suppose 
that c is a pole of order n of /. Then 
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1. For all ze B' r (c) , 

CO 

/(*)= Y, «^- c ) fc - ( 7 - 64 ) 

k— — n 

2. The infinite series of part (1) converges uniformly on each compact subset K of B r (c) . 

3. For any piecewise smooth geometric set S C B r (c) , whose boundary Cs has finite length, 
and satisfying c & S°, 

f /(C)dC = 27r»o_i, (7.65) 

where A_i is the coefficient of (z — c)~ in the series of part (1). 

Proof: 

For each z € B r (c) , write / (z) = (z — c)" / (z) . Then, by Theorem 7.14, p. 202, / is analytic on 
B r (c) , whence 



/(*) 



(z-c) 



m 

(z-c)" 

hrT,Z^k(z-c) k (7.66) 



Y,T=-n a k{z-c) 



k 



where Ofe = c„ + fe. This proves part (1). 

We leave the proof of the uniform convergence of the series on each compact subset of B r (c) , 
i.e., the proof of part (2), to the exercises. 

Part (3) follows from Cauchy's Theorem (Theorem 7.3, Cauchy's Theorem, Fundamental The- 
orem of Analysis, p. 188) and the computations in Exercise 7.2. Thus: 

Ic s f(0dC = f c J(C)d( 

= I C ^k=-n a k{z-c) k dC 
= T,T=-n a kJ Cr (C- c ) kd C 

= a_i2iri, 

as desired. The summation sign comes out of the integral because of the uniform convergence of 
the series on the compact circle C r . 

Exercise 7.14 

a. Complete the proof to part (2) of the preceding theorem. That is, show that the infinite 
series Yl'kL-n a k{ z ~ c ) converges uniformly on each compact subset K of B r (c) . HINT: 
Use the fact that the Taylor series J2^Lo Cn ( z ~ c )™ f° r / conver g es uniformly on the entire 
disk B r (c) , and that if c is not in a compact subset K of B r (c) , then there exists a 6 > 
such that \z — c| > 5 for all z s K. 

b. Let /, c, and / be as in the preceding proof. Show that 

°- 1 = t^tt (7 - 68) 

c. Suppose g is a function defined on a punctured disk B r (c) that is given by the formula 

CO 

g(z)= ]T a k (z-cf (7.69) 

fc— — n 
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for some positive integer n and for all z € B r (c) . Suppose in addition that the coefficient 
d-n 7^ 0. Show that c is a pole of order n of g. 

Having defined two kinds of isolated singularities of a function /, the removable ones and the polls of finite 
order, there remain all the others, which we collect into a third type. 

Definition 7.3: 

Let / be continuous on a punctured disk B' r (c) , and analytic at each point of B r (c) . The point 
c is called an essential singularity of / if it is neither a removable singularity nor a poll of any 
finite order. Singularities that are either poles or essential singularities are called nonremovable 
singularities. 

Exercise 7.15 

For z/0, define / (z) = e 1 / 2 . Show that is an essential singularity of /. 

Theorem 7.16: 

Let / be continuous on a punctured disk B' r (c) , analytic at each point of B r (c) , and suppose 
that c is an essential singularity of /. Then 

1. For all z e B' r (c) , 

oo 

f(z)= Y, Mz-cf, (7-70) 

k — — oo 

where the sequence {afc}!^ has the property that for any negative integer N there is a 
k < N such that a^ ^ 0. 

2. The infinite series in part (1) converges uniformly on each compact subset K of B r (c) . That 
is, if F n is defined by F n (z) = Y^k=-n a k{z — c) , then the sequence {F n } converges uniformly 
to / on the compact set K. 

3. For any piecewise smooth geometric set S C B r (c) , whose boundary C$ has finite length, 
and satisfying c € 5°, we have 



f /(C)dC = 27r»o_i, (7.71) 

J C s 



where o_i is the coefficient of (z — c) in the series of part (1). 



Proof: 

Define numbers {ak} ?^ as follows. 



— SS/*,^* ( " 2) 



Note that for any < 5 < r we have from Cauchy's Theorem that 



"4/ c ,^t <" 3 > 



where Cs denotes the boundary of the disk B$ (c) . 
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Let z / cbe in B r (c) , and choose 6 > such that 5 < \z—c\. Then, using part (c) of Exercise 7.5, 
and then mimicking the proof of Theorem 7.5, p. 192, we have 

flz) = ^f r l^d(-^f r lMdc 

J ^ ' 2-kiJ C r C, — z 27riJ Cg Q—z ^ 

- J_ f 110 AC + J- f t(0 AC 

— 1-Kii C r (C-C)-(Z-C) W S ^ 2-KiJ C S (Z-C)-(C-C) "^ 

- J_f ZK) !_,//■ +J_f ZK) ^wr 

f(Q -frt^^^v 00 _i f f (f\(r — Ai ''Art* _ ^-J- ] 



2irtJ C r C-c ^fc=0 VC-c/ ^ ^ 2-KiJ C s z-c <^j=0 \z-cj al » (7 741 



5Xo & J Cr (c^Fft d«* - C ) fc + Ef=o ak/c./ (0 (C - c) 3 d«z - c)~ 

J2T= Mz - c) k + Efei-oo ^ilc s (C-1P+ 1 ^ " c ) fc 
Er=o <**(* - c ) fc + Efc=-oo 2¥i/c„ (C-S+ 1 d ^ z " c ) fc 



(C-c)* 

fe= 



Eoo / \ k 

k=-oo a k{Z-c) , 



which proves part (1). 

We leave the proofs of parts (2) and (3) to the exercises. 

Exercise 7.16 

a. Justify bringing the summation signs out of the integrals in the calculation in the preceding 
proof. 

b. Prove parts (2) and (3) of the preceding theorem. Compare this with Exercise 7.14. 

7.3: 

REMARK The representation of / (z) in the punctured disk B r (c) given in part (1) of The- 
orem 7.15, p. 203 and Theorem 7.16, p. 205 is called the Laurent expansion of / around the 
singularity c. Of course it differs from a Taylor series representation of /, as this one contains neg- 
ative powers of z — c. In fact, which negative powers it contains indicates what kind of singularity 
the point c is. 

Non removable isolated singularities of a function / share the property that the integral of / around a 
disk centered at the singularity equals 2nia-i, where the number o_i is the coefficient of (z — c)~ in the 
Laurent expansion of / around c. This number 27ria_i is obviously significant, and we call it the residue of 
f at c, and denote it by Rf (c) . 

Combining Theorem 7.13, p. 201, Theorem 7.15, p. 203, and Theorem 7.16, p. 205, we obtain: 

Theorem 7.17: Residue Theorem 

Let S be a piecewise smooth geometric set whose boundary has finite length, let ci, ..., c n be points 
in 5°, and suppose / is a complex- valued function that is continuous at every point z in S except 
the Cfe's, and differentiable at every point z € S° except at the c^'s. Assume finally that each c^ is 
a nonremovable isolated singularity of /. Then 



/n 
/(c)dc = E%( c *)- ( 7 - 75 ) 



fe=l 
That is, the contour integral around Cs is just the sum of the residues inside S. 

Exercise 7.17 

Prove Theorem 7.17, Residue Theorem, p. 206. 

Exercise 7.18 

Use the Residue Theorem to compute /„ / (Q) dC, for the functions / and geometric sets S given 
below. That is, determine the poles of / inside S, their orders, the corresponding residues, and 
then evaluate the integrals. 
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a. / (z) = sin (3z) /z 2 , and S = B~i (0) . 

b. f{z) = e^ z ,andS = B 1 {0). 

c. / ( z ) = e 1 ^ 2 , and S = B 1 (0). _ 

d. / (z) = (1/z (z - 1)) , and S = B 2 (0) . 
/ (z) = ((1 - z 2 ) jz (1 + z 2 ) (2z + l) 2 ) , and S = B 2 (0) 



o. 



f. / (z) = 1/ (l + z 4 ) = (1/ (z 2 - i) (z 2 + i)) , and S = B r (0) for any r > 1. 

The Residue Theorem, a result about contour integrals of functions of a complex variable, can often provide 
a tool for evaluating integrals of functions of a real variable. 

Example 7.1 

Consider the integral 



dx. (7.76) 



1 . 

Let us use the Residue Theorem to compute this integral. 
Of course what we need to compute is 

f B 1 

lira I j dx. (7.77) 

B^ooJ_ B l + x A v ; 

The first thing we do is to replace the real variable a; by a complex variable Z, and observe that the 
function / (z) = 1/ (l + z 4 ) is analytic everywhere except at the four points ±e l7r / 4 and ±e 3t7r / 4 . 
See part (f) of the preceding exercise. These are the four points whose fourth power is — 1, and 
hence are the poles of the function /. 

Next, given a positive number B, we consider the geometric set (rectangle) Sb that is determined 
by the interval [—B,B] and the two bounding functions I (x) = and u(x) = B. Then, as long 
as B > 1, we know that / is analytic everywhere in 5° except at the two points c\ = e OT / 4 and 
c-i = e 3l7T / 4 , so that the contour integral of / around the boundary of Sb is given by 

d( = R f (ci) + R f (c 2 ) • (7.78) 



- i + C 

Now, this contour integral consists of four parts, the line integrals along the bottom, the two sides, 
and the top. The magic here is that the integrals along the sides, and the integral along the top, 
all tend to as B tends to infinity, so that the integral along the bottom, which after all is what 
we originally were interested in, is in the limit just the sum of the residues inside the geometric set. 

Exercise 7.19 

Verify the details of the preceding example. 

a. Show that 

C B 1 

Urn / j dt = 0. (7.79) 

B^J i + (B + itf 



b. Verify that 



c. Show that 



B 



B 



Urn j j dt = 0. (7.80) 

' b l + {t + iB) 4 



j_^— dx = ^2. (7.81) 



20 „ CHAPTER 7. THE FUNDAMENTAL THEOREM OF ALGEBRA, AND THE 

FUNDAMENTAL THEOREM OF ANALYSIS 

Methods similar to that employed in the previous example and exercise often suffice to compute integrals of 
real- valued functions. However, the method may have to be varied. For instance, sometimes the appropriate 
geometric set is a rectangle below the x-axis instead of above it, sometimes it should be a semicircle instead 
of a rectangle, etc. Indeed, the choice of contour (geometric set) can be quite subtle. The following exercise 
may shed some light. 

Exercise 7.20 



a. Compute 



oo ix 



and 



oo — ix 



- dx (7.82) 

dx. (7.83) 

J-oo 1 T x- 

b. Compute 

sin(-x) 

' dx (7.84) 



-OO 

and 



l + x 

/OO 
^—^dx. (7.85) 

Example 7.2 

An historically famous integral in analysis is J_ sinx/x dx. The techniques described above don't 
immediately apply to this function, for, even replacing the x by a z, this function has no poles, so 
that the Residue Theorem wouldn't seem to be much help. Though the point is a singularity, it 
is a removable one, so that this function sinz/z is essentially analytic everywhere in the complex 
plane. However, even in a case like this we can obtain information about integrals of real-valued 
functions from theorems about integrals of complex- valued functions. 

Notice first that /_ sinx/x dx is the imaginary part of /_ e lx /xdx, so that we may as well 
evaluate the integral of this function. Let / be the function defined by / (z) = e tz /z, and note that 
is a pole of order 1 of /, and that the residue Rf (0) = 2iri. Now, for each B > and 5 > define 
a geometric set Sb,s, determined by the interval [-B, B] , as follows: The upper bounding function 
ub,s is given by ub,5 (x) = B, and the lower bounding function Ib,s is given by Ib,s (x) = for 
—B < x < —S and 5 < x < B, and Ib,s (x) = 6e l ' KX l & for —S < x < 5. That is, Sb.s is just like the 
rectangle Sb in Example 1 above, except that the lower boundary is not a straight line. Rather, 
the lower boundary is a straight line from —B to —5, a semicircle below the x-axis of radius 5 from 
—5 to 5, and a straight line again from 5 to B. 

By the Residue Theorem, the contour integral 

/(C)dC = -R/(0) = 27r». (7.86) 

c s B ,s 

As in the previous example, the contour integrals along the two sides and across the top of Sb,s 
tend to as B tends to infinity. Finally, according to part (e) of Exercise 6.15, the contour integral 
of / along the semicircle in the lower boundary is iri independent of the value of 5. So, 



Urn lira I — dC, = ffi, (7.87) 

B^oo5->0j graph( ; B _,) C 

implying then that 

POO 

t ^^nx 

— dx = ir. (7.88) 



f 

■J —: 
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Exercise 7.21 

a. Justify the steps in the preceding example. In particular, verify that 

t-B e i(B+it) 

Urn / — dt = 0, (7.89) 

B^ceJ B + it y J 

,-B e i(t+iB) 

Urn / — dt = 0, (7.90) 

and 

r P H 

-d( = TTi, (7.91) 



C 6 



C 



where C$ is the semicircle of radius 5, centered at the origin and lying below the x-axis. 
b. Evaluate 



oo • 2 

sin x 



dx. (7.92) 
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Chapter 8 

Appendix: Existence and Uniqueness of 
a Complete Ordered Field 1 



This appendix is devoted to the proofs of Theorem 1.1, p. 13 and Theorem 1.2, p. 13, which together 
assert that there exists a unique complete ordered field. Our construction of this field will follow the ideas 
of Dedekind, which he presented in the late 1800's. 

Definition 8.1: 

By a Dedekind cut, or simply a cut, we will mean a pair (A, B) of nonempty (not necessarily 
disjoint) subsets of the set Q of rational numbers for which the following two conditions hold. 

1. A U B = Q. That is, every rational number is in one or the other of these two sets. 

2. For every element a € A and every element b s B,A < b. That is, every element of A is less 
than or equal to every element of B. 

Recall that when we define the rational numbers as quotients (ordered pairs) of integers, we faced the 
problem that two different quotients determine the same rational number, e.g., 2/3 = 6/9. There is a similar 
equivalence among Dedekind cuts. 

Definition 8.2: 

Two Dedekind cuts (Ai,bi) and (^2,-82) are called equivalent if 01 < 62 for all <x\ € A\ and all 
i>2 € B2, and a 2 < b\ for all a 2 € A 2 and all b\ e B\. In such a case, we write (Ai, B\) = (A 2 , B 2 ) ■ 
Exercise 8.1 

a. Show that every rational number r determines three distinct Dedekind cuts that are mutually 
equivalent . 

b. Let B be the set of all positive rational numbers r whose square is greater than 2, and let 
A comprise all the rationals not in B. Prove that the pair (A, B) is a Dedekind cut. Do you 
think this cut is not equivalent to any cut determined by a rational number r as in part (a)? 
Can you prove this? 

c. Prove that the definition of equivalence given above satisfies the three conditions of an equiv- 
alence relation. Namely, show that 

i. (Reflexivity) (A, B) is equivalent to itself. 

ii. (Symmetry) If (A t , B x ) = {A 2 ,B 2 ) , then (A 2 , B 2 ) = (A U B{) . 
hi. (Transitivity) If (A^B^ = {A 2 ,B 2 ) and {A 2 ,B 2 ) = {A 3 ,B 3 ) , then (A u B{) = {A 3 , B 3 ) . 



1 This content is available online at <http://cnx.Org/content/m36243/l.2/>. 
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There are three relatively simple-sounding and believable properties of cuts, and we present them in the next 
theorem. It may be surprising that the proof seems to be more difficult than might have been expected. 

Theorem 8.1: 

Let (A, B) be a Dedekind cut. Then 

1. If a G A and a < a, then a G A. 

2. If b G B and b' > b, then b' G B. 

3. Let e be a positive rational number. Then there exists an a G A and a b G B such that 

b — a < e. 

Proof: 

Suppose a is an element of A, and let a < a be given. By way of contradiction suppose that a 
does not belong to A. Then, by Condition (1) of the definition of a cut, it must be that a G B. 
But then, by Condition (2) of the definition of a cut, we must have that a < a , and this is a 
contradiction, because a < a. This proves part (1). Part (2) is proved in a similar manner. 

To prove part (3), let the rational number e > be given, and set r = e/2. Choose an element 
ao e A and an element bo G B. Such elements exist, because A and B are nonempty sets. Choose a 
natural number N such that ao + Nr > bo. Such a natural number TV must exist. For instance, just 
choose N to be larger than the rational number (bo — ao) l r - Now define a sequence {a,k} of rational 
numbers by a^ = ao + kr, and let K be the first natural number for which ajf G B. Obviously, such 
a number exists, and in fact K must be less than or equal to N. Now, a^-i ls not in B, so it must 
be in A. Set a = Ak-\ and b = Ak- Clearly, a G A,b G B, and 

b — a = (ik — <xk-\ = do + Kr — ao — (K — 1) r = r = - < e, (8-1) 

and this proves part (3). 

We will make a complete ordered field F whose elements are the set of equivalence classes of Dedekind 
cuts. We will call this field the Dedekind field. To make this construction, we must define addition and 
multiplication of equivalence classes of cuts, and verify the six required field axioms. Then, we must define 
the set P that is to be the positive elements of the Dedekind field F, and then verify the required properties of 
an ordered field. Finally, we must prove that this field is a complete ordered field; i.e., that every nonempty 
set that is bounded above has a least upper bound. First things first. 

Definition 8.3: 

If (Ai,Bi) and (^.2,-82) are Dedekind cuts, define the sum of (Ai,B{) and (A 2 ,B 2 ) to be the cut 
(A3, B3) described as follows: B3 is the set of all rational numbers 63 that can be written as b\ + 62 
for some b\ G B\ and 62 G B2, and A3 is the set of all rational numbers r such that r < 63 for all 
03 G B 3 . 
Several things need to be checked. First of all, the pair (A3, B3) is again a Dedekind cut. Indeed, it is clear 
from the definition that every element of A3 is less than or equal to every element of B3, so that Condition 
(2) is satisfied. To see that Condition (1) holds, let r be a rational number, and suppose that it is not in 
A 3 . We must show that r belongs to B 3 . Now, since r ^ A3, there must exist an element 63 = 61 + b 2 G B 3 
for which r > 63. Otherwise, r would be in A3. But this means that r — 62 > b\, and so by part (2) of 
Theorem 8.1, p. 212, we have that r — 62 is an element b\ of B\. Therefore, r = b\ + 62, implying that r G B3, 
as desired. 

We define the cut to be the pair Ao = {r : r < 0} and Bq = {r : r > 0}. This cut is one of the three 
determined by the rational number 0. 

Exercise 8.2 

a. Prove that addition of Dedekind cuts is commutative and associative. 

b. Prove that if (A 1 ,B l ) = (Ci.Di) and (A 2 ,B 2 ) = (C 2 ,D 2 ), then (Ai.fli) + (A 2 ,B 2 ) = 
(Ci,Di) + (C 2 ,I>2). 
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c. Find an example of a cut (A, B) such that (A, B) + =/= (A, B) . 

d. Prove that {A, B) + 0= {A, B) for every cut (A, B) . 

We define addition in the set F of all equivalence classes of Dedekind cuts as follows: 

Definition 8.4: 

If x is the equivalence class of a cut (A, b) and y is the equivalence class of a cut (C, D) , then x + y 
is the equivalence class of the cut (^4, B) + (C, D) . 

It follows from the previous exercise, that addition in F is well-defined, commutative, and associative. 
We are on our way. 

We define the element of F to be the equivalence class of the cut. The next theorem establishes one 
of the important field axioms for F, namely, the existence of an additive inverse for each element of F. 

Theorem 8.2: 

If (A, B) is a Dedekind cut, then there exists a cut (A' , B) such that (A, B) + (A', B) is equivalent 
to the cut. Therefore, if x is an element of F, then there exists an element y of F such that x+y = 0. 
Proof: 

Let A' = —B, i.e., the set of all the negatives of the elements of B, and let B' = —A, i.e., the set of 
all the negatives of the elements of A. It is immediate that the pair [A , B) is a Dedekind cut. Let 
us show that (A, B) + (A' , B') is equivalent to the zero cut. Let (C, D) = (A, B) + (A' , B') . Then, 
by the definition of the sum of two cuts, we know that D consists of all the elements of the form 
d=b+b' = b— a, where b £ B and a e A. Since a < b for all a s A and b s B, we see then that 
the elements of D are all greater than or equal to 0. To see that (C, D) is equivalent to the cut, 
it will suffice to show that D contains all the positive rational numbers. (Why?) Hence, let e > 
be given, and choose an a s A and a b s B such that b — a < e. This can be done by Condition (3) 
of Theorem 8.1, p. 212. Then, the number b — a € D, and hence, by part (2) of Theorem 8.1, p. 
212, e e D. It follows then that the cut (C, D) is equivalent to the zero cut (Ao, Bq) , as desired. 

We will write — (A, B) for the cut (A , B) of the preceding proof. 
Exercise 8.3 

a. Suppose (A, B) is a cut, and let (C, D) be a cut for which (A, B) + (C, D) is equivalent to the 
cut. Show that (C, D) = (A , B') = - {A, B) . 

b. Prove that the additive inverse of an element x of the Dedekind field F is unique. 

The definition of multiplication of cuts, as well as multiplication in F, is a bit more tricky. In fact, we will 
first introduce the notion of positivity among Dedekind cuts. 

Definition 8.5: 

A Dedekind cut x = (A, B) is called positive if A contains at least one positive rational number. 

Exercise 8.4 

a. Suppose (A,B) and (C,D) are equivalent cuts, and assume that (A,B) is positive. Prove 
that (C, D) also is positive. Make the obvious definition of positivity in the set F. 

b. Show that the sum of two positive cuts is positive. Conclude that the sum of two positive 
elements of F, i.e., the sum of two equivalence classes of positive cuts, is positive. 

c. Let (A, B) be a Dedekind cut. Show that one and only one of the following three properties 
holds for (A, B) . (i) (A, B) is a positive cut, (ii) — (A, B) is a positive cut, or (iii) (A, B) is 
equivalent to the cut. 

d. Establish the law of tricotomy for F : That is, show that one and only one of the following 
three properties holds for an element x € F. (i) x is positive, (ii) —x is positive, or (iii) x = 0. 
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We first define multiplication of cuts when one of them is positive. 

Definition 8.6: 

Let (A\, Bi) and (A2, B 2 ) be two Dedekind cuts, and suppose that one of these cuts is a positive 
cut. We define the product ( A3, B 3) of (A\,B\) and (A 2 ,B 2 ) as follows: Set B3 equal to the set of 
all 63 that can be written as 6162 for some b\ € B\ and 62 £ B 2 . Then set A3 to be all the rational 
numbers r for which r < b 3 for all 63 G B 3 . 

Again, things need to be checked. 

Exercise 8.5 

a. Show that the pair (A3, B3) of the preceding definition for the product of positive cuts is in 
fact a Dedekind cut. 

b. Prove that multiplication of Dedekind cuts, when one of them is positive, is commutative. 

c. Suppose (Ai,Bi) is a positive cut. Prove that 

(A^Bt) ((A 2 ,B 2 ) + (A 3 , B 3 )) = (A u B t ) (A 2 ,B 2 ) + (A^B,) (A 3 ,B 3 ) (8.2) 

for any cuts (^2,-82) and (^3, £3) . 

d. Show that, if (Ai,B x ) = (A 2 ,B 2 ) and (Ci,£>i) = (C 2 ,D 2 ) and (a\,B{) and (A 2 ,B 2 ) are 
positive cuts, then (^1,^) (Ci,£>i) = (A 2 , B 2 ) (C 2 , D 2 ) . 

e. Show that the product of two positive cuts is again a positive cut. 

We are ready to define multiplication in F. 

Definition 8.7: 

Let x and y be elements of F. 

If either x or y is positive, define the product x x y to be the equivalence class of the cut 
(A B) (C, D) , where x is the equivalence class of (A, B) and y is the equivalence class of (C, D) . 

If either x or y is 0, define x x y to be 0. 

If both x and y are negative, i.e., both —x and — y are positive, define x x y = (— x) x (— y) . 

The next exercise is tedious. It amounts to checking a bunch of cases. 
Exercise 8.6 

a. Prove that multiplication in F is commutative. 

b. Prove that multiplication in F is associative. 

c. Prove that multiplication in F is distributive over addition. 

d. Prove that the product of two positive elements of F is again positive. 

We define the element 1 of F to be the equivalence class of the cut (A 1 ^ 1 ) , where A 1 = {r : r < 1} and 
B 1 = {r : r > 1}. 

Exercise 8.7 

a. Prove that the elements and 1 of F are not equal. 

b. Prove that x x 1 = x for every element x € F. 

c. Use the associative law and part (b) to prove that if xy = 1 and xz = 1, then y = z. 

Theorem 8.3: 

With respect to the operations of addition and multiplication defined above, together with the 
definition of positive elements, F is an ordered field. 
Proof: 

The first five axioms for a field, given in Section 1.2, have been established for F in the preceding 
exercises, so that we need only verify axiom 6 to complete the proof that F is a field. Thus, let 
x e F be a nonzero element. We must show the existence of an element y of F for which x x y = 1. 
Suppose first that x is a positive element of F. Then x is the equivalence class of a positive cut 
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(A, B) , and therefore A contains some positive rational numbers. Let ao be a positive number that 
is contained in A. It follows then that every element of B is greater than or equal to ao and hence 

is positive. Define B to be the set of all rational numbers r for which r > 1/6 for every b s B. Then 

define A to be the set of all rationals r for which r <b for every b&B ■ It follows directly that the 

pair I A, B I is a Dedekind cut. 



Let (C, D) = {A, B) x I A, B I , and note that every element d € D is of the form d = b b, and 

hence is greater than or equal to 1. We claim that (C,D) is equivalent to the cut (A 1 ,^ 1 ) that 
determines the element 1 of F. To see this we must verify that D contains every rational number 
r that is greater than 1. Thus, let r > 1 be given, and set e = ao (r — 1) . From Condition (3) of 
Theorem 8.1, p. 212, choose an a € A and a b' € B such that b' — a < e. Without loss of generality, 

we may assume that a > ao- Finally, set b= 1/a . Clearly b> 1/b for all b e B, so that b£B ■ Also 

d = b b& D, and 



d = b b= — = ; <1 + — <1+ — = r, (8.3) 

a a a ao 

implying that r s D. Therefore, {C,D) is equivalent to the cut (A 1 ^ 1 ) , implying that (A, B) x 
A, B ) is equivalent to the cut (A 1 , B 1 ) . Therefore, if y is the element of F that is the equivalence 



class of the cut I A, B I , then x x y = 1, as desired. 

If x is negative, then —a; is positive. If we write z for the multiplicative inverse of the positive 
element —x, then — z is the multiplicative inverse of the element x. Indeed, by the definition of the 
product of two negative elements of F,x x (—z) = (—x) x z = 1. 

The properties that guarantee that F is an ordered field also have been established in the 
preceding exercises, so that the proof of this theorem is complete. 

So, the Dedekind field is an ordered field, but we have left to prove that it is complete. This means we 
must examine upper bounds of sets, and that requires us to understand when one cut is less than another 
one. We say that a cut (A, B) is less than or equal to a cut C, D if a < d for every a s A and d e D. We say 
that an element x in the ordered field F is less than or equal to an element y if y — x is either positive or 0. 

Theorem 8.4: 

Let x and y be elements of F, and suppose x is the equivalence class of the cut (A, B () and y is 
the equivalence class of the cut (C, D) . Then x < y if and only if (A, B) < (C, D) . 
Proof: 

We have that x < y if and only if the element y — x = y H x is positive or 0. Writing, 

as before, (A',B) for the cut — (A,B) , we have that y — x is the equivalence class of the cut 
(C,d)-(A,B) = (C,D) + (A',B') , so we need to determine when the cut (G,H) = (C,D) + (A' ,B') 
is a positive cut or the cut; which is the case when the set H only contains nonnegative numbers. 
By definition of addition, the set H contains all numbers of the form h = d + b' for some d € D 
and some b € B ' . Since B' = —A, this means that H consists of all elements of the form h = d — a 
for some d € D and a e A. Now these numbers h are all greater than or equal to if and only if 
each a£i4is less than or equal to each d € D, i.e., if and only if (A, B) < (C, D) . This proves the 
theorem. 
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We are now ready to present the first of the two main theorems of this appendix, that is Theorem 1.1, 
p. 13 in Section 1.4. 

Theorem 8.5: 

There exists a complete ordered field. Indeed, the Dedekind field F is a complete ordered field. 
Proof: 

Let S be a nonempty subset of F, and suppose that there exists an upper bound for S; i.e., an 
element M of F such that x < M for all x € S. Write (A, B) for a cut such that M is the equivalence 
class of (A, B) . We must show that there exists a least upper bound for S. 

For each x G S, let (A x , B x ) be a Dedekind cut for which x is the equivalence class of (A x , B x ) , 
and note that a x < b for all a x G A x and all b G B. Let Aq be the union of all the sets A x for x G S. 
Let B be the set of all rational numbers r for which r > ao for every a G A . we claim first that the 
pair (Aq,Bq) is a Dedekind cut. Both sets are nonempty; Ao because it is the union of nonempty 
sets, and Bo because it contains all the elements of the nonempty set B. Clearly Condition (2) for 
a cut holds from the very definition of this pair. To see Condition (1), let r be a rational number 
that is not in Bq. We must show that it is in Aq. Now, since r is not in Bq, there must exist some 
a G A for which r < a . But ao G U xe sA x , so that there must exist an x G S such that a G A x , 
and hence r is also in A x . But then r G Aq, and this proves that (Aq, B ) is a Dedekind cut. 

Let Mq be the equivalence class determined by the cut (Aq,Bq) . Since each A x C Aq, we see 
that a x < &o f° r every a x G A x and every &o G -Bo- Hence, (A x , B x ) < (Aq, Bq) for every x G S, and 
therefore, by Theorem A. 4, x < Mo for all x e S. This shows that Mq is an upper bound for S. 

Finally, suppose M' is another upper bound for S, and let (A,B) be a cut for which M' is 
the equivalence class of (A,B) . Then a x < b' for every a x G A x and every b' G B ' , implying 
that a < b' for every a G A and every b' G B '. Therefore, (A ,B ) < (A,B) , implying that 
Mq < M' . This shows that M is the least upper bound for S, and the theorem is proved. 

We come now to the second major theorem of this appendix, i.e., Theorem 1.2, p. 13 of Section 1.4. This 
one asserts the uniqueness, up to isomorphism, of complete ordered fields. 

Theorem 8.6: 

Let fbea complete ordered field. Then there exists an isomorphism of F onto the Dedekind field 
F. That is, there exists a one-to-one function J :F^ F that is onto all of F, and that satisfies 

1. J(x + y) = J(x) + J(y). 

2. J (xy) = J (x) J (y) . 

3. If x > 0, then J (x) > 0. 

Proof: 

We know from Section 1.1 that, inside any ordered field, there is a subset that is isomorphic to the 

field Q of rational numbers. We will therefore identify this special subset of F with Q. 

If x is an element of F, let A x = {r G Q : r < x} and let B x = {r G Q : r > x}. We claim first 
that the pair (A X ,B X ) is a Dedekind cut. Indeed, from the definition of A x and B x , we see that 
Condition (2), i.e., that each a x G A x is less than or equal to each b x G B x , holds. To see that 

Condition (1) also holds, let r be a rational number in F ■ Then, because F is an ordered field, 
either r < x or r > x, i.e., r G A x or r G B x . Hence, (A x , B x ) is a Dedekind cut. 

We define a function J from F into F by setting J (x) equal to the equivalence class determined 
by the cut (A x , B x ) . We must check several things. 

First of all, J is one-to-one. Indeed, let x and y be elements of F that are not equal. Assume 
without loss of generality that x < y. Then, according to , which is a theorem about complete 
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ordered fields and hence applicable to F„ there exist two rational numbers r\ and r-i such that 
x < r\ < r2 < y, which implies that n G B x and T2 G A y . Since r-i > n, the cut (A y ,B y ) is not 
equivalent to the cut (A x , S x ) , and therefore J (a;) ^ J (y) . 

Next, we claim that the function J is onto all of the Dedekind field F. Indeed, let z be an element 
of F, and let (A, B) be a Dedekind cut for which z is the equivalence class determined by (A, B) . 

Think of A as a subset of the complete ordered field F ■ Then A is nonempty and is bounded above. 
In fact, every element of B is an upper bound of A. Let x = supA. (Here is another place where 

we are using the completeness of the field F •) We claim that the cut (A, B) is equivalent to the 
cut (A x , B x ) , which will imply that J (x) = z. Thus, if a x G A x , then a x < x, and x < b for every 
b G B, because x is the least upper bound of A. Similarly, if a G A, then a < x, and x < b x for 
every b x G B x . This proves that the cuts (A,B) and (A X ,B X ) are equivalent, as desired. 

If x and y are elements of F, and b x G B x and b y G B y , then b x > x and b y > y, so that 
b x + b y > cc + y, and therefore b x + b y G B x+y for every 6^ € -Ba; and 6j, G B y . On the other 
hand, if r G B x+y , then r > x + «/. Therefore, r — x > y, implying, again by Theorem 1.8, p. 
18, that there exists an element b y e B y such that y < b y < r — x. But then r — b y > x, which 
means that r — b y = b x for some b x G B x . So, r = b x + b y , and this shows that B x+y = b x + B y . 
It follows from this that the cuts (A x+y , B x+y ) and (A x ,B x ) + (A y ,B y ) are equal, and therefore 

J (x + y) = J (x) + J (y) . A consequence of this is that J (— x) = — J (x) for all x G_F . 

If x and y are two positive elements of F, then an argument just like the one in the preceding 
paragraph shows that J (xy) = J (x) J (y) . Then, since J(— x) = — J(x) , the fact that J (xy) = 

J (x) J (y) for all x, y G_F follows. 

Finally, if x is a positive element of F, then the set A x must contain some positive rationals, 
and hence the cut (A x , B x ) is a positive cut, implying that J (x) > 0. 

We have verified all the requirements for an isomorphism between the two fields F and F, and 
the theorem is proved. 



218 GLOSSARY 



Glossary 



A bounded real- valued function / on a closed bounded interval [a, b] is called Riemann-integrable 
if, given any e > 0, there exist step functions k and I, on [a, b] for which k (x) < f (x) < I (x) for 
all x, such that J (I — k) < e. We denote the set of all functions on [a, 6] that are 
Riemann-integrable by Ir ([a, b\) . 

A complex number c is called a pole of a function / if there exists an r > such that / is 
continuous on the punctured disk B' r (c) , analytic at each point of B r (c) , the point c is not a 
removable singularity of /, and there exists a positive integer k such that the analytic function 
(z — c) / (z) has a removable singularity at c. 

A pole c of / is said to be of order n, if n is the smallest positive integer for which the function 
/ (z) = (z — c) n f (z) has a removable singularity at c. 

A complex number c is called a removable singularity of an analytic function / if there exists an 
r > such that / is continuous on the punctured disk B' r (c) , analytic at each point in B r (c) , 

and lim z ^ c f (z) exists. 

A Dedekind cut x = (A, B) is called positive if A contains at least one positive rational number. 

A differential form on a subset U of R 2 is denoted by u> = Pdx + Qdy, and is determined by two 
continuous real- valued functions P and Q on U. We say that u> is bounded or uniformly 
continuous if the functions P and Q are bounded or uniformly continuous functions on U. We 
say that the differential form ui is smooth of order k if the set U is open, and the functions P 
and Q have continuous mixed partial derivatives of order k. 

If ui = Pdx + Qdy is a differential form on a set U, and if C is any piecewise smooth curve of 
finite length contained in U, then we define the line integral J „u) of uj over C by 

f ui= f Pdx + Qdy= f P ( 7 (t)) x (t) + Q (7 (*)) V (t) eft, (6.48) 

J c J c Jo 

where 7 (t) = (x (t) , y (t)) is a parameterization of C by arc length. 

A field F is called an ordered field if there exists a subset P C F that satisfies the following two 
properties: 

• If x, y € P, then x + y and xy are in P. 

• If x € F, then one and only one of the following three statements is true. 

• xe P, 

■ —x € P, and 

• x = 0. (This property is known as the law of tricotomy.) 
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A field is a nonempty set F on which there are defined two binary operations, addition (+) and 
multiplication (x), such that the following six axioms hold: 

• Both addition and multiplication are commutative and associative. 

• Multiplication is distributive over addition; i.e., 

xx(y+z) = xxy + xxz (1-6) 

for all x,y, z G F. 

• There exists an element in F, which we will denote by 0, that is an identity for addition; i.e., 
x + = x for all x € F. 

• There exists a nonzero element in F, which we will denote by 1, that is an identity for 
multiplication; i.e., x x 1 = x for all x € F. 

• If x G F, then there exists a unique element y e F such that x + y = 0. This element y is 
called the additive inverse of x and is denoted by —a;. 

• If x € F and x ^ 0, then there exists a unique element y e F such that x x y = 1. This 
element y is called the multiplicative inverse of x and is denoted by x~ l . 

A function / whose domain S equals — S, is called an even function if / (—z) = f (z) for all z in 
its domain. It is called an odd function if / (—z) = —f (z) for all z in its domain. 

A function / : S — > C is called uniformly continuous on S if for each positive number e, there 
exists a positive number 5 such that \f (x) — f (y)\ < e for all x,y € S satisfying \x — y\ < 5. 

A nonzero polynomial or polynomial function is a complex- valued function of a complex 
variable, p : C — > C, that is defined by a formula of the form 

n 

P ( z ) = Z-/ akzk = a o + a i z + a 2^ 2 + ... + a n z n , (3-11) 

fe=0 

where the a^'s are complex numbers and a n ^ 0. The integer n is called the degree of the 
polynomial p and is denoted by deg (p) . The numbers Oo, Oi, ..., a n are called the coefficients of 
the polynomial. The domain of a polynomial function is all of C; i.e., p(z) is defined for every 
complex number z. 

For technical reasons of consistency, the identically function is called the zero polynomial. All 
of its coefficients are and its degree is defined to be — oo. 

A rational function is a function r that is given by an equation of the form r (z) = p (z) jq (z) , 
where q is a nonzero polynomial and p is a (possibly zero) polynomial. The domain of a rational 
function is the set S of all z £ C for which q (z) ^ 0, i.e., for which r (z) is defined. 

A partition of a closed geometric set S in R 2 is a finite collection {Si, S2, ■■■, S n } of 

nonoverlapping closed geometric sets for which U"_ 1 S , j = S; i.e., the union of the Si's is all of the 

geometric set S. 

The open subsets {5°} are called the elements of the partition. 

A step function on the closed geometric set S is a real- valued function h on S for which there 

exists a partition P = {Si} of S such that h (z) = aj for all z € S*; i.e., h is constant on each 

element of the partition P. 

A real number x that is not a rational number, i.e., is not an element of the subset Q of R, is 
called an irrational number. 
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A sequence of real or complex numbers is defined to be a function from the set N of natural 
numbers into the set_R or C. Instead of referring to such a function as an assignment n — ► / (n) , 
we ordinarily use the notation {a n },{a n }^°, or {01,02,03, ...}. Here, of course, a n denotes the 
number / (n) . 

A sequence {a n } of real numbers is called nondecreasing if a n < a n +i for all n, and it is called 
nonincreasing if a n > a„ +1 for all n. It is called strictly increasing if a n < a n+1 for all n, and 
strictly decreasing if a n > o„+i for all n. 

A sequence {a n } of real numbers is called eventually nondecreasing if there exists a natural 
number TV such that <z n < a n +i for all n > N, and it is called eventually nonincreasing if there 
exists a natural number N such that a n > a n+ i for all n> N. We make analogous definitions of 
"eventually strictly increasing" and "eventually strictly decreasing." 

A sequence {a n } of real or complex numbers is a Cauchy sequence if for every e > 0, there exists 
a natural number N such that if n > N and m > N then \a n — a m \ < e. 

A subset S of C is called Bounded if there exists a real number M such that \z\ < M for every z 
in S. 



An ordered field F is called complete if every nonempty subset S of F that has an upper bound 
has a least upper bound. 



By a (open) rectangle we will mean a set R = (a, b) x (c, d) in R 2 . That is, 
R = {(x, y) : a < x < b and c < y < d}. The analogous definition of a closed 
rectangle[a, b] x [c, d] should be clear: [a, b] x [c, d] = {(x, y) : a < x < 6, c < y < d}. 
By the area of a (open or closed) rectangle R = (a, b) x (c, d) or [a, 6] x [c, d] we mean the 
number A (R) = (b — a) (d — c) . . 

By a Dedekind cut, or simply a cut, we will mean a pair (A, B) of nonempty (not necessarily 
disjoint) subsets of the set Q of rational numbers for which the following two conditions hold. 

• A U B = Q. That is, every rational number is in one or the other of these two sets. 

• For every element a & A and every element b s 73,^1 < b. That is, every element of A is less 
than or equal to every element of B. 

By a smooth curve from a point z\ to a different point z-i in the plane, we mean a set C C C 

that is the range of a 1-1, smooth, function <j> : [a, b] — > C, where [a, b] is a bounded closed 

interval in R, where z\ = <j> (a) and zi = <f> (b) , and satisfying <fi' (t) 7^ for all t € (a, b) . 

More generally, if <f> : [a, b] — > i? 2 is 1-1 and piecewise smooth on [a, b] , and if {£0 < t\ < ... < t n } 

is a partition of [a, b] such that (j> (t) / for all t € (t»_i, £») , then the range C of (/> is called a 

piecewise smooth curve from zi = (j> (a) to Z2 = (f> (b) . 

In either of these cases, <j> is called a parameterization of the curve C. 



D 
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By a vector field on an open subset U of R 2 , we mean nothing more than a continuous function 

V {x, y) = {P (x, y) , Q (x, y)) from U into R 2 . The functions P and Q are called the components 

of the vector field V . 

We will also speak of smooth vector fields, by which we will mean vector fields V both of whose 
component functions P and Q have continuous partial derivatives 

tialP tialP tialQ tialQ 

, , and (6.46) 

tialx tialy tialx tialy 

on U. 
By the set R of real numbers we mean the (unique) complete ordered field. 

Define a power series function, denoted by exp, as follows: 

00 z n 
exp{z) = Y^—. (3.38) 

n=0 

We will call this function, with 20-20 hindsight, the exponential function. 

Define two power series functions cosh (hyperbolic cosine) and sinh (hyperbolic sine) by 

, . , exp(z) + exp(-z) , . , . . exp(z) — exp(z) . „ . 

cosh{z) = y ' — and sinh{z) = y ' ^-^-, (3.39) 

and two other power series functions cos (cosine) and sin (sine) by 

, n exp(iz) + exp(-iz) 
cos (z) = cosh (iz) = Fy ' — ^ '- (3.40) 

and 

. , s . . ,,. s exp {iz) -exp {-iz) 
svn{z) = —isinh{iz) = : . (»j-41) 

The five functions just defined are called the elementary transcendental functions, the sinh and 
cosh functions are called the basic hyperbolic functions, and the sine and cosine functions are 
called the basic trigonometric or circular functions. The connections between the hyperbolic 
functions and hyperbolic geometry, and the connection between the trigonometric functions and 
circles and triangles, will only emerge in the next chapter. From the very definitions, however, 
we can see a connection between the hyperbolic functions and the trigonometric functions. It's 
something like interchanging the roles of the real and imaginary axes. This is probably worth 
some more thought. 



F 



For a a positive real number and z an arbitrary complex number, define a z by 

a z = exp {zln {a)) . (4.72) 



G 
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Given [a, b] , I, and u as in the above, let S be the set of all pairs (x, y) G R 2 , for which a < x < b 
and I (x) < y < u (x) . Then S is called an open geometric set. If we replace the < signs with < 
signs, i.e., if S is the set of all (x, y) such that a < x < b, and I (x) < y < u (x) , then S is called 
a closed geometric set. In either case, we say that S is bounded on the left and right by the 
vertical line segments {(a, y) : 1(a) < y < u (a)} and {(b,y) : 1(b) < y < u (&)}, and it is bounded 
below by the graph of the function I and bounded above by the graph of the function u. We call 
the union of these four bounding curves the boundary of S, and denote it by Cg. 
If the bounding functions u and I of a geometric set S are smooth or piecewise smooth functions, 
we will call S a smooth or piecewise smooth geometric set. 



If (Ai, Bi) and (A%, B2) are Dedekind cuts, define the sum of (A\, B{) and (A2, -B2) to be the 
cut (^3,^3) described as follows: S3 is the set of all rational numbers 63 that can be written as 
b\ + 62 for some b\ e B\ and 62 € £>2 , and A3 is the set of all rational numbers r such that 
r < 63 for all 63 G _E> 3 . 

If / and g are two complex- valued functions with the same domain S, i.e., / : S — > C and 

g : S — » C, and if c is a complex number, we define f + g, fg, f/g (if g (x) is never 0), and c/ 

by the familiar formulas: 

(f + g)(x) = f(x) + g(x), (3.5) 

(fg)(x) = f(x)g(x), (3.6) 

(//9)(*W(*)/5(z), (3-7) 

and 

(c/)0r)=c/(a0. (3.8) 

If / and g are real-valued functions, we define functions max (/, g) and rain (/, g) by 

[m«i (/, 3)] (aj) = max (/ (x) , g (x)) (3.9) 

(the maximum of the numbers / (x) and g (x)), and 

[min (/, 3)] (x) = mm (/ (x) , g (x)) , (3.10) 

(the minimum of the two numbers / (x) and g (x)). 

If / is either a real-valued or a complex-valued function on a domain S, then we say that / is 
bounded if there exists a positive number M such that \f (x) | < M for all x G S. 

If / is a power series function, the number r of the preceding theorem is called the radius of 
convergence of the power series. The disk S of radius r around 0, denoted by B r (0) , is called 
the disk of convergence. 

If / is a real- valued function on a closed geometric set S in the plane, then / is integrable on S if 
it is the uniform limit of a sequence {h n } of step functions on S. 
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We define the integral of an integrable function / on S by 



/= / /0) dz = lim h n , (5.102) 

s J s J s 

where {h n } is a sequence of step functions on S that converges uniformly to /. 

If F is an ordered field, and x and y are elements of F, we say that x < y if y — igP. We say 

that x < y if either x < y or x = y. 

We say that x > y if y < x, and a; > y if y < x. 

If 5 is a subset of an ordered field F, then an element x e F is called an upper bound for S if 

x > y for every y G S. An element z is called a lower bound for S if z < y for every y & S. 

A subset S of an ordered field F is called bounded above if it has an upper bound; it is called 

bounded below if it has a lower bound; and it is called bounded if it has both an upper bound 

and a lower bound. 

An element M is called the least upper bound or supremum of a set S if it is an upper bound 

for S and if M < x for every other upper bound x of S. That is, M is less than or equal to any 

other upper bound of S. 

Similarly, an element m is called the greatest lower bound or infimum of S if it is a lower bound 

for S and if z < m for every other lower bound z of S. That is, m is greater than or equal to any 

other lower bound of S. 



If a; is a positive real number, then the symbol \fx will denote the unique positive number y for 
which y 2 = x. Of course, y/0 denotes the number 0. 

If x is an element of a field F, define inductively elements n ■ x = nx of F by 1 • x = x, and, if 
k ■ x is defined, set (k + 1) ■ x = x + k ■ x. The set S of all natural numbers n for which n ■ x is 
defined is therefore, by the axiom of mathematical induction, all of N. 

If x is the equivalence class of a cut (A, b) and y is the equivalence class of a cut (C, D) , then 
x + y is the equivalence class of the cut (A, B) + (C, D) . 

If z = x + yi is in C, we define the absolute value of z by 



\z\ = ■ s fx" 2 + y" 2 . (1.40) 

We define the distanced (z,w) between two complex numbers z and w by 
d(z, w) = \z — w\. 
If c € C and r > 0, we define the open disk of radius r around c, and denote it by B r (c) , by 

B r {c) = {zeC: \z-c\ <r}. (1.41) 

The closed disk of radius r around c is denoted by B r (c) and is defined by 

B r {c) = {z e C : \z-c\ < r}. (1.42) 
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We also define open and closed punctured disks B r (c) and B r (c) around c by 

B r (c) = {z : < \z - c\ < r} (1.43) 

and 

B r (c) = {z : < \z-c< r}. (1.44) 

These punctured disks are just like the regular disks, except that they do not contain the 
central point c. 

More generally, if S is any subset of C, we define the open neighborhood of radius r around S, 
denoted by N r (S) , to be the set of all z such that there exists a w G S for which \z — w\ < r. 
That is, N r (S) is the set of all complex numbers that are within a distance of r of the set S. We 
define the closed neighborhood of radius r around S, and denote it by N r (S) , to be the set of 
all z G C for which there exists & w e S such that \z — w\ < r. 

If z = x + yi, we say that the real number x is the real part of z and write x = ^t(z) . We say 

that the real number y is the imaginary part of z and write y = Q (z) . 

If z = x + yi is a complex number, define the complex conjugate's of z by z = x — yi. 



Let (Ai, Bi) and (A2, B2) be two Dedekind cuts, and suppose that one of these cuts is a positive 
cut. We define the product(Az, B3) of (Ai,B\) and (^2,-82) as follows: Set S3 equal to the set 
of all 63 that can be written as 6162 for some b\ G B\ and 62 G Bi- Then set A3 to be all the 
rational numbers r for which r < 63 for all 63 G B3. 

Let [a, b] be a closed bounded interval in R. A real-valued function h : [a,b] — » R is called a step 
function if there exists a partition P = {xq < x\ < ... < x n } of [a, b) such that for each 1 < i < n 
there exists a number a, such that h (x) = a{ for all x G (xt-i, Xi) . 

Let [a, b] be a closed bounded interval of real numbers. A function / : [a, b] — > R is called 
integrable on [o, b] if it is the uniform limit of a sequence {h n } of step functions. 
Let I ([a, b}) denote the set of all functions that are integrable on [o, b] . If / G / ([a, b}) , define 
the integral of /, denoted J f, by 



f = lim hn, (5.19) 

where {h n } is some (any) sequence of step functions that converges uniformly to / on [a, b] . 
As in the case of step functions, we use the following notations: 

/= f f= f f(t)dt. (5.20) 

Let [a, b] be a closed bounded interval of real numbers. By a partition of [o, b] we mean a finite 

set P = {xq < x\ < ... < x n } of n + 1 points, where xq = a and x n = b. 

The n intervals {[;ej_i,2;j]} are called the closed subintervals of the partition P, and the n 

intervals {(xi-i,Xi)} are called the open subintervals or elements of P. 

We write || P \\ for the maximum of the numbers (lengths of the subintervals) {xi — Xi-i}, and 

call || P || the mesh size of the partition P. 
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If a partition P = {xi} is contained in another partition Q = {yj}, i.e., each xi equals some yj, 
then we say that Q is finer than P. 

Let / be a function on an interval [a, b] , and let P = {xq < ... < x n } be a partition of [a, b] . 
Physicists often consider sums of the form 

n 

i=\ 

where yi is a point in the subinterval (xi-i,Xi) . These sums (called Riemann sums) are 
approximations of physical quantities, and the limit of these sums, as the mesh of the partition 
becomes smaller and smaller, should represent a precise value of the physical quantity. What 
precisely is meant by the " limit" of such sums is already a subtle question, but even having 
decided on what that definition should be, it is as important and difficult to determine whether 
or not such a limit exists for many (or even any) functions /. We approach this question from a 
slightly different point of view, but we will revisit Riemann sums in the end. 

Let [a, b] be a closed bounded interval of real numbers. By a partition of [a, b] we mean a finite 

set P = {xq < x\ < ... < x n } of n + 1 points, where xq = a and x n = b. 

The n intervals {[£1-1,2;,]}, for 1 < i < n, are called the closed subintervals of the partition P, 

and the n intervals {(xi-i,Xi)} are called the open subintervals of P. 

We write || P || for the maximum of the numbers (lengths of the subintervals) {xi — Xi_i}, and 

call the number || P || the mesh size of the partition P. 

A function h : [a, b] — > C is called a step function if there exists a partition 

P = { x o < x\ < ... < x n } of [a, b] and n numbers {01,02, ...,a n } such that h{x) = a, if 

Xi-i < x < Xi. That is, h is a step function if it is a constant function on each of the (open) 

subintervals (xi-i, xi) determined by a partition P. Note that the values of a step function at 

the points {xi} of the partition are not restricted in any way. 

A function I : [a, b] — » R is called a polygonal function, or a piecewise linear function, if there 

exists a partition P = {x < X\ < ... < x n } of [a, b] and n + 1 numbers {y , J/i, ••■, y n } such that 

for each x € [:Ej_i,a:,] ,1 (x) is given by the linear equation 

I (x) = yi-i + rrii (x - Xi-i) , (3-12) 

where rrii = (y; — Vh) / (xi — #i-i) • That is, I is a polygonal function if it is a linear function on 
each of the closed subintervals [xi-i, Xi] determined by a partition P. Note that the values of a 
piecewise linear function at the points {xi} of the partition P are the same, whether we think of 
Xi in the interval [:Ej_i, Xj\ or [xi, Xi + i] . (Check the two formulas for / [xi) .) 
The graph of a piecewise linear function is the polygonal line joining the n+ 1 points {(xi, Vi)}- 
There is a natural generalization of the notion of a step function that works for any domain S, 
e.g., a rectangle in the plane C. Thus, if S is a set, we define a partition of 5 to be a finite 
collection {.Ei, E2, ..., E n } of subsets of S for which 

• VT l=1 Ei = S, and 

• E t n Ej = if i =£ j. 

Then, a step function on S would be a function h that is constant on each subset Ei. We will 

encounter an even more elaborate generalized notion of a step function in Chapter V, but for 

now we will restrict our attention to step functions defined on intervals [a, b] . 

The set of polynomials and the set of step functions are both closed under addition and 

multiplication, and the set of rational functions is closed under addition, multiplication, and 

division. 
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Let [a, b] be a fixed bounded and closed interval. A complex- valued function / = u + iv is called 
integrable if its real and imaginary parts u and v are integrable. In this case, we define 

f'b f'b f'b f'b 

I f= (u + iv)= u + i v. (5.32) 

J a J a J a J a 

Let [a, b] be a fixed closed bounded interval in R. We define the integral of a step function h on 
[a, b] , and denote it by J h, as follows: If P = {xq < x\ < ... < x n } is a partition of [a, b] , for 
which h (x) = a>i for all x G (ajj-i, x{) , then 

/n 
h = S P (h) = y^ Qj [xj - a;j_i) . (5.5) 

i=l 

Let a and 6 be real numbers for which a < b. By the open interval (a,b) we mean the set of all 
real numbers x for which a < x < b, and by the closed interval [a,b] we mean the set of all real 
numbers x for which a < x < b. 



Let a be a natural number. We define inductively natural numbers a" as follows: a 1 = a, and, 
whenever a k is defined, then a k+1 is defined to be a x a*. 



Let C be a piecewise smooth curve from z\ to z^ in the plane C, parameterized by a 
(complex- valued) function <f> : [a, b] — > C. If / is a continuous, complex-valued function on C, 
The contour integral of f from z\ to zi along C will be denoted by J c f (£) dC, or more precisely 
by J ' c z z \f (C) d(, and is defindd by 

/ ?J(()d(= I f(4>(t))4>'(t)dt. (6.41) 

J C Jo, 

Let C be a piecewise smooth curve in the plane. The length or arc lengthL = L(C) of C is 
defined by the formula 

L{C) = L< t > = supL P , (6.24) 

p 

where <f> is any parameterization of C. 

If z and w are two points on a piecewise smooth curve C, we will denote by L (z, w) the arc 

length of the portion of the curve between z and w. 

Let C be a piecewise smooth curve of finite length L joining distinct points, and let 

7 : [0, L] — > C be a parameterization of C by arc length. By a partition of C we mean a set 

{z , zi, ..., z n } of points on C such that Zj = 7 (tj) for all j, where the points {t < t\ < ... < t n } 

form a partition of the interval [0, L] . The portions of the curve between the points Zj-i and Zj, 

i.e., the set ^(tj-\,tj) , are called the elements of the partition. 

A step fucntion on C is a real- valued function h on C for which there exists a partition 

{zo, z\, ..., z n } of C such that h (z) is a constant dj on the portion of the curve between Zj-\ and 

Zi. 
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Let C be a piecewise smooth curve of finite length L. A function / with domain C is called 
integrable with respect to arc length on C if it is the uniform limit of step functions on C. 
The integral with respect to arc length of an integrable function / on C is again denoted by 
J c f (s) ds, and is defined by 



/ (s) ds = km I h n (s) ds, (6.34) 

c J c 

where {h n } is a sequence of step functions that converges uniformly to / on C. 

Let C, the range of <fi : [a, b] — » C, be a piecewise smooth curve, and let z = (x,y) = <f> (c) be a 
point on the curve. We say that the curve C has a tangential direction at z, relative to the 
parameterization cf>, if the following limit exists: 

<j>{t)-z <f>(t)-^(c) 
km- — — = km- — — — - -. (6.7 

If this limit exists, it is a vector of length 1 in R 2 , and this unit vector is called the unit tangent 
(relative to the parameterization <f>) to C at z. 

The curve C has a unit tangent at the point z if there exists a parameterization <fi for which the 
unit tangent at z relative to <j> exists. 

Let F be a field, and let a; be a nonzero element of F. 

For each natural number n, we define inductively an element x n in F as follows: x 1 = x, and, if 

x k is defined, set x k+1 = x x x k . Of course, x n is just the product of nx's. 

Define x° to be 1. 

For each natural number n, define x~ n to be the multiplicative inverse (x n )~ of the element x n . 

Finally, we define m to be for every positive integer to, and we leave _n and 0° undefined. 

Let / be a real or complex- valued function on the open interval (a, b) where a is possibly — oo 
and b is possibly +oo. We say that / is improperly-integrable on (a, b) if it is integrable on each 
closed and bounded subinterval [a , b'] C (a, b) , and for each point c e (a, 6) we have that the 

two limits limb — > b — J f and km a -^ a+0 J , f exist. 

More generally, We say that a real or complex- valued function /, not necessarily defined on all of 

the open interval (a, b) , is improperly-integrable on (a, b) if there exists a partition {xi} of [a, b] 

such that / is defined and improperly-integrable on each open interval (xi~i,Xi) . 

We denote the set of all functions / that are improperly-integrable on an open interval (a, b) by 

Ii((a,b)). 

Let / be continuous on a punctured disk B' r (c) , and analytic at each point of B r (c) . The point 
c is called an essential singularity of / if it is neither a removable singularity nor a poll of any 
finite order. Singularities that are either poles or essential singularities are called nonremovable 
singularities. 

Let / be defined and improperly-integrable on an open interval (o, b) . We define the integral of / 
over the interval (a, b) , and denote it by J /, by 

b t'C t'b 

f= km f+ km / /. (5.81) 

o'^a+ol' b'^b-0j c 
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In general, if / is improperly- integrable over an open interval, i.e., / is defined and 
improperly-integrable over each subinterval of (o, b) determined by a partition {xi}, then we 
define the integral of / over the interval (a, b) by 

n I'Xi 

/ = £/ /• ( 5 - 82 ) 



Let / be in C™ (2? r (c)) for c a fixed complex number, r > 0, and n a positive integer. Define the 
Taylor polynomial of degree n for / at c to be the polynomial T n = TJ\ c -, given by the formula: 

n 

fe.c)) (*) = £«*(*-<=)'', (4-115) 

where <2j = /(■?) (c) /j!. 

Let Fi and F 2 be two ordered fields, and write P\ and P 2 for the set of positive elements in F\ 
and F 2 respectively. A 1-1 correspondence J between F\ and i<2 is called an isomorphism if 

• J (x + y) = J (x) + J (y) for all x, y G Fi- 

• J (xy) = J (x) J (y) for all x,y & F\. 

• a; € Pi if and only if J (x) € P2 • 

Let / : S — » C be a function, where S C C, and let c be a limit point of S 1 that is not necessarily 
an element of S. We say that /has a iimit L as z approaches c, and we write 

limf(z) = L, (4.1) 

z — >c 

if for every e > there exists a <$ > such that if z G S and < |z — c\ < 5, then \f (z) — L\ < e. 
If the domain S is unbounded, we say that f has a limit L as z approachesoo, and we write 

L= limf(z), (4.2) 

z— >oo 

if for every £ > there exists a positive number B such that if z 6 5 and |z| > _B, then 

|/(z)-L|<e. 

Analogously, if 5 C R, we say lirrix^oof (x) = L if for every e > there exists a real number B 

such that if x € 5 and x > B, then |/ (x) — L| < e. And we say that lim x ->-oof i x ) = L if for 

every e > there exists a real number 5 such that if x s £ and x < B, then |/ (x) — L| < e. 

Finally, for / : (a, 6) — > C a function of a real variable, and for c € [a, 6] , we define the one-sided 

(left and right) limits of / at c. We say that / has a left hand limit of L at c, and we write 

L = Um x ^ c _ n f (x) , if for every e > there exists a <5 > such that if x € (a, 6) and 

< c — x < 5 then |/ (x) — L\ < e. We say that / has a right hand limit of L at c, and write 

L = lim x ->c+of (x) , if for every e > there exists a <5 > such that if x € S and < x — c < (5 

then |/(x) - L\ < e. 

Let / : 5 — > i? be a function whose domain is a subset S of R 2 , and let c = (a, 6) be a point in 
the interior 5° of S. We say that / is different iable, as a function of two real variables, at the 
point (a, b) if there exists a pair of real numbers L\ and Li and a function 9 such that 

/ (a + hi, b + ha) - f (a, b) = L x h x + L 2 h 2 + 6 (hi,h 2 ) (4.140) 
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and 

8 (hi, ho) 
Urn , /■ \ 7. = 0. 4.141 

One should compare this definition with part (3) of 2 . 

Each partial derivative of a function / is again a real- valued function of two real variables, and 
so it can have partial derivatives of its own. We use simplifying notation like f xyxx and f y yy Xy y... 
to indicate "higher order" mixed partial derivatives. For instance, f xxyx denotes the fourth 
partial derivative of /, first with respect to x, second with respect to x again, third with respect 
to y, and finally fourth with respect to x. These higher order partial derivatives are called mixed 
partial derivatives. 

Let / : S — > R be a real- valued function of a real variable, and let c be an element of the interior 

of S. Then / is said to attain a local maximum at c if there exists a 6 > such that 

(c - 6, c + 5) C S and / (c) > / (x) for all x € (c - S, c + 8) . 

The function / is said to attain a local minimum at c if there exists an interval (c — 5, c + 5) C S 

such that / (c) < / (x) for all a; € (c — 6, c + 8) . 

Let / : S — > R be defined on a set S C C = R 2 , and let c = (a, b) = + + hi be a point in the 
interior of S. We define the partial derivative of f with respect to x at the point c = (a, b) by the 
formula 

— — (o, b) = hm ; , (4.25) 

and the partial derivative of f with respect to y at c = (a, 6) by the formula 

^(a,6) = fa /(ffi ' i+/t) - /(fl - i) , (4.26) 

tialy h->0 ft 

whenever these limits exist. (In both these limits, the variable h is a real variable.) ( 

It is clear that the partial derivatives of a function arise when we fix either the real part of the 

variable or the imaginary part of the variable to be a constant, and then consider the resulting 

function of the other (real) variable. We will see in 3 that there is a definite difference between a 

function's being differentiable at a point c= (a + bi) in the complex plane C versus its having 

partial derivatives at the point (a, b) in R 2 . 

Let / : S — > T and g : T — » U be functions. We define a function g o /, with domain S and 
codomain U, by (g o /) (x) =g(f(x)). 

If / : S — > T,g : T — > 5, and go/ (x) = a; for all x e S, then 5 is called a left inverse of /. If 
/ ° 9 (y) = V f° r a ll ?/ G T, then g is called a right inverse for /. If g is both a left inverse and a 
right inverse, then g is called an inverse for /,/ is called invertible, and we denote g by / _1 . 

Let /i be a step function on a closed geometric set S. Define the integral of h over the geometric 
set S by the formula 

f h= J H(z) dz = ^Ta l A(S i ) 1 (5.99) 

J s J s i=1 

2 http://cnx.org/content/m36206/latest/ 
3 http://cnx.org/content/m36186/latest/ 



230 GLOSSARY 

where S\,...,S n is a partition of S for which h is the constant Oj on the interior Sf of the set Si. 

Let ftbea step function on a piecewise smooth curve C of finite length L. The integral, with 
respect to arc length of h over C is denoted by J „h (s) ds, and is defined by 

/n 
h (s) ds = y^ a jL (zj-i, Zj) , (6.32) 

C 3 = 1 

where {zq, z\, ..., z n } is a partition of C for which h (z) is the constant a,j on the portion of C 
between Zj-i and Zj. 

Let h be a step function on the closed interval [a, b] . Suppose P = {xo < x\ < ... < x n } is a 
partition of [a, b] such that h (x) = a,i on the interval (xi-i, Xi) . Define the weighted average of 
hrelative toP to be the number Sp (h) defined by 

n 

Sp{h) = ^2ai(xi- Xj_i). (5.3) 

Let i denote an object whose square i 2 = — 1. Let C be the set of all objects that can be 
represented in the form z = x + yi, where both x and y are real numbers. 
Define two operations + and x on C as follows: 

(x + yi) + (x + y i) = x + x + (y + y) i, (1-34) 

and 

(x + iy) (x + iy ) = xx + xiy + iyx + iyiy = xx — yy + (xy + yx) i. (1.35) 

Let n be a natural number. As earlier in this chapter, we define n! as follows: 

n\ = n x (n - 1) x (n - 2) x ... x 2 x 1. (1.19) 

For later notational convenience, we also define 0! to be 1. 
If k is any integer for which < k < n, we define the binomial coefficient (™) by 

/n\ n\ n x (n — 1) x (n — 2) x ... x (n — k+ 1) 

\k) = k\{n-k)\ = k\ ■ (1 ' 2 °^ 

Let S and T be sets of complex numbers, and let / : S — > T. Then / is said to be continuous at a 

pointc of S if for every positive e, there exists a positive 6 such that if x € S satisfies \x — c\ < S, 

then |/ (x) — / (c) | < e. The function / is called continuous on S if it is continuous at every 

point c of S. 

If the domain S of / consists of real numbers, then the function / is called right continuous at c 

if for every e > there exists a 5 > such that \f (x) — / (c) | < e whenever x € S and 

< x — c < S, and is called left continuous at c if for every e > there exists a c> > such that 

|/ (x) — f (c) | < e whenever x G 5 and > x — c > — <5. 
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Let S and T be sets. A function from S into T (notation / : S — > T) is a rule that assigns to 

each element a; in S a unique element denoted by / (x) in T. 

It is useful to think of a function as a mechanism or black box. We use the elements of S as 

inputs to the function, and the outputs are elements of the set T. 

If / : S — > T is a function, then S is called the domain of /, and the set T is called the codomain 

of /. The range or image of / is the set of all elements y in the codomain T for which there 

exists an x in the domain S such that y = f (x) . We denote the range by / (S) . The codomain 

is the set of all potential outputs, while the range is the set of actual outputs. 

Suppose / is a function from a set S into a set T. If A C S, we write / (A) for the subset of T 

containing all the elements ieT for which there exists an s E A such that t = f (s) . We call 

/ (A) the image of A under /. Similarly, if B C T, we write / _1 (_B) for the subset of S 

containing all the elements s € S such that / (s) G £>, and we call the set f~ x (B) the inverse 

image or preimage of 5. The symbol / _1 (B) is a little confusing, since it could be 

misinterpreted as the image of the set B under a function called f~ l . We will discuss inverse 

functions later on, but this notation is not meant to imply that the function / has an inverse. 

If / : S — > T, then the graph of / is the subset G of the Cartesian product S xT consisting of all 

the pairs of the form (a;, / (x)) . 

If / : S — > R is a function, then we call / a real-valued function, and if / : S — > C, then we call / 

a complex-valued function. If / : S — * C is a complex-valued function, then for each x G S the 

complex number / (x) can be written as u (x) + iv (x) , where u (x) and v (x) are the real and 

imaginary parts of the complex number / (x) . The two real- valued functions u : S — > R and 

v : S — » i? are called respectively the reai and imaginary parts of the complex- valued function /. 

If / : S — » T and SCR, then / is called a function of a reai variable, and if S C C, then / is 

called a function of a complex variable. 

If the range of / equals the codomain, then / is called onto. 

The function / : S — > T is called one-to-one if / (xi) = / (X2) implies that x\ = X2- 

Let 5 be a geometric set (either open or closed), bounded on the left by x = a, on the right by 
x = 6, below by the graph of I, and above by the graph of u. Define the areaA (S) of S by 

n 

A (5) = supAp = sup Y^ (xi - Xj_i) (di - c,) , (5.67) 

P P={a;o<a;i<...<a;„} i=1 

where the supremum is taken over all partitions P of [a, b] , and where the numbers Cj and dj 
are as defined above. 

Let 5 be a subset of C (respectively R. By an open cover of S we mean a sequence {U n } of open 
subsets of C (respectively R) such that S C Ut/„; i.e., for every x € S'there exists an n such that 
x G [/„. 

A subset S of C (respectively i2) is called compact, or is said to satisfy the Heine-Borel 
property, if every open cover of S has a finite subcover. That is, if {U n } is an open cover of S, 
then there exists an integer N such that S C U^^L^. In other words, only a finite number of 
the open sets are necessary to cover S. 

Let S be a subset of C, let / : S — » C be a complex-valued function, and let c be a point of S. 
Then / is said to be expandable in a Taylor series around c with radius of convergence r if there 
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exists an r > such that B r (c) C S, and / (z) is given by the formula 

DC 

f{z) = Y j a n {z-c) n (3.51) 

for all z & B r (c) . 

Let S be a subset of R, let / : S — » -R be a real-valued function on S*, and let c be a point of S*. 
Then / is said to be expandable in a Taylor series around c with radius of convergence r if there 
exists an r > such that the interval (c — r,c + r) C S 1 , and / (x) is given by the formula 

DC 

f{x) = Y j a n {x-c) n (3.52) 

/i=0 

for all x € (c — r, c + r) . 

Suppose S is an open subset of C. A function / : S — > C is called analytic on S if it is 
expandable in a Taylor series around every point c of S. 

Suppose S is an open subset of R. A function / : S — > C is called reai analytic on S if it is 
expandable in a Taylor series around every point c of S. 

Let 5 be a subset of C. A complex number x is called a limit point of £ if there exists a sequence 
{x n } of elements of S such that x = limx n . 
A set S C C is called closed if every limit point of S 1 belongs to 5. 

Let 5 be a subset of C. A point x € 5 is called an interior point of 5 if there exists an e > such 

that the open disk B e (x) of radius e around x is entirely contained in S. The set of all interior 

points of S is denoted by S° and we call 5° the interior of S. 

A subset S of C is called an open subset of C if every point of S is an interior point of S; i.e., if 

S = S°. 

Analogously, let S be a subset of R. A point x € S is called an interior point of S if there exists 

an e > such that the open interval (x — £,x + e) is entirely contained in S. Again, we denote 

the set of all interior points of S by S° and call S° the interior of S. 

A subset S of R is called an open subset of R if every point of S is an interior point of S; i.e., if 

S = S°. 

Let 5 be a subset of R (or C), and Let / : S — » C be a function of a real (or complex) variable. 
We say that / is continuously different iable on S° if / is differentiable at each point x of S° and 
the function /' is continuous on S° . We say that / s C 1 (S) if / is continuous on S and 
continuously differentiable on 5°. We say that / is 2-times continuously differentiable on S° if 
the first derivative / is itself continuously differentiable on S°. And, inductively, we say that / 
is k-times continuously differentiable on S° if the k — 1st derivative of / is itself continuously 
differentiable on 5°. We write f^ for the kth derivative of /, and we write / s C k (S) if / is 
continuous on S and is k times continuously differentiable on 5°. Of course, if / s C k (S) , then 
all the derivatives f^\ for j < k, exist nd are continuous on 5°. (Why?) 

For completeness, we define /(°) to be / itself, and we say that / s C°° (S) if / is continuous on 
S and has infinitely many continuous derivatives on S°; i.e., all of its derivatives exist and are 
continuous on S°. 

As in 4 , we say that / is real-analytic (or complex-analytic) on S if it is expandable in a Taylor 
series around each point c E S° 



4 http://cnx.org/content/m36192/latest/ 
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Let S be a subset of R, let / : S — > C be a complex- valued function (of a real variable), and let c 
be an element of the interior of S. We say that / is differentiable at c if 

km (4.8) 

h^o h 

exists. (Here, the number h is a real number.) 

Analogously, let S be a subset of C, let / : S — > C be a complex- valued function (of a complex 
variable), and let c be an element of the interior of S. We say that / is differentiable at c if 

ImM^ (4.9) 

h^O h 

exists. (Here, the number h is a complex number.) 

If / : S — ► C is a function either of a real variable or a complex variable, and if S' denotes the 
subset of S consisting of the points c where / is differentiable, we define a function f':S'—*C 
by 

f'(x) = lim f{x+h) - f{x) . (4.10) 

ft— >o /l 

The function /' is called the derivative of /. 

A continuous function / : [a, b] — » C that is differentiable at each point a; € (a, 6) , and whose the 
derivative /' is continuous on (a, 6) , is called a smooth function on [a, 6] . If there exists a 
partition {a = xo < x\ < ... < x n = b} of [a, b] such that / is smooth on each subinterval 
[x»_i, Xi] , then / is called piecewise smooth on [a, b] . 

Higher order derivatives are defined inductively. That is, /' is the derivative of /', and so on. 
We use the symbol /(") for the nth derivative of /. 



Let S be a subset of R 2 . By the symmetric image of S we mean the set S of all points 
(x, y) G R 2 for which the point (y, x) e S. 

Let x and y be elements of F. 

If either x or y is positive, define the product x x y to be the equivalence class of the cut 

(A, B) (C, D) , where x is the equivalence class of (A, B) and y is the equivalence class of (C, D) . 

If either x or y is 0, define x x y to be 0. 

If both x and y are negative, i.e., both —x and — y are positive, define x x y = (—x) x (— y) . 

Let {a n } be a sequence of real numbers and let L be a real number. The sequence {a„} is said to 
converge to L, or that L is the limit of {a„}, if the following condition is satisfied. For every 
positive number e, there exists a natural number N such that if n > N, then \a n — L\ < e. 
In symbols, we say L = lima n or 

L = lim a n . (2-1) 

n — >oo 

We also may write a n <—> L. 

If a sequence {a n } of real or complex numbers converges to a number L, we say that the 
sequence {a n } is convergent. 
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We say that a sequence {a n } of real numbers diverges to +00 if for every positive number M, 

there exists a natural number N such that if n > N, then a n > M. Note that we do not say that 

such a sequence is convergent. 

Similarly, we say that a sequence {a n } of real numbers diverges to — 00 if for every real number 

M, there exists a natural number N such that if n > N, then a n < M. 

The definition of convergence for a sequence {z n } of complex numbers is exactly the same as for 

a sequence of real numbers. Thus, let {z n } be a sequence of complex numbers and let L be a 

complex number. The sequence {z n } is said to converge to L, or that L is the limit of {z n }, if 

the following condition is satisfied. For every positive number s, there exists a natural number 

N such that if n > N, then \z n — L\ < e. 

Let {a n } be a sequence of real numbers and let S denote its cluster set. 

If S is nonempty and bounded above, we define lim supa n to be the supremum supS of S. 

If S is nonempty and bounded below, we define lim infa n to be the infimum infS of S. 

If the sequence {a n } of real numbers is not bounded above, we define lim supa n to be 00, and if 

{a n } is not bounded below, we define lim infa n to be —00. 

If {^n} diverges to 00, then we define lim supa n and lim infa n both to be 00. And, if {a n } 

diverges to —00, we define lim supa n and lim infa n both to be —00. 

We call lim supa n the limit superior of the sequence {a n }, and lim infa n the limit inferior of 

{a n }. 

Let {a n } be a sequence of real or complex numbers. A number x is called a cluster point of the 
sequence {a n } if there exists a subsequence {bk} of {a n } such that x = limb},. The set of all 
cluster points of a sequence {a n } is called the cluster set of the sequence. 

Let {a n } be a sequence of real or complex numbers. A subsequence of {a n } is a sequence {bk} 
that is determined by the sequence {a n } together with a strictly increasing sequence {rik} of 
natural numbers. The sequence {bk} is defined by bk = a nk . That is, the fcth term of the 
sequence {bk} is the n^th term of the original sequence {a n }. 

Let {a„}§° be a sequence of real or complex numbers. By the infinite series^ a n we mean the 
sequence {Sn} defined by 

N 

S N = Y j an- (2.31) 

n=0 

The sequence {Sn} is called the sequence of partial sums of the infinite series Yl a n, and the 
infinite series is said to be summable to a number S, or to be convergent, if the sequence {Sn} of 
partial sums converges to .S.The sum of an infinite series is the limit of its partial sums. 

An infinite series Yl a n is called absolutely summable or absolutely convergent if the infinite 

series J2 \ a n\ is convergent. 

If Yl a n is not convergent, it is called divergent. If it is convergent but not absolutely 

convergent, it is called conditionally convergent. 

A few simple formulas relating the a n 's and the Sn's are useful: 

S N = a + Oi + a 2 + ... + a N , (2.32) 
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Sn+i = Sn + ajv+i, (2.33) 

and 

M 

Sm - S K = ^2, a n = a K+i + a>K+2 + ■■■ + A M , (2.34) 

n=K+l 

for M > K. 

Let {a n }§° be a sequence of real or complex numbers. By the power series 

function/ (z) = Yl^Lo a « z ™ we mean the function / : S — ► C where the domain S is the set of all 

z € C for which the infinite series Yl a>nZ n converges, and where / is the rule that assigns to 

such az£S the sum of the series. 

The numbers {a n } defining a power series function are called the coefficients of the function. 

We associate to a power series function / (z) = X^^lo a « z " its sequence {Sn} of partial sums. 

We write 

N 

Sn (z) = Y, a ^ n - ( 3 - 31 ) 

Notice that polynomial functions are very special cases of power series functions. They are the 
power series functions for which the coefficients {a n } are all beyond some point. Note also 
that each partial sum Sn for any power series function is itself a polynomial function of degree 
less than or equal to N. Moreover, if / is a power series function, then for each z in its domain 
we have f (z) = UtunSn (z) . Evidently, every power series function is a "limit" of a sequence of 
polynomials. 

Obviously, the domain S = Sf of a power series function / depends on the coefficients {a n } 
determining the function. Our first goal is to describe this domain. 

Let a be a complex number, and let k be a nonnegative integer. We define the general binomial 
coefficient^) by 

a\ a (a — I) ... (a — k + I) 

If a is itself a positive integer and k < a, then (?) agrees with the earlier definition of the 
binomial coefficient, and ( ?) = when k > a. However, if a is not an integer, but just an 
arbitrary complex number, then every (?) ^ 0. 

Let <j) : [a, b] — > C be a parameterization of a piecewise smooth curve C C C. By the lengthL^ of 
C, relative to the parameterization <f>, we mean the number L^ = sup P Lp, where the supremum 



is taken over all partitions P of [a, b] 



Suppose S is a subset of R 2 , and that / is a continuous real- valued function on S. If both partial 
derivatives of / exist at each point of the interior S° of S, and both are continuous on S°, then 
/ is said to belong to C 1 (S) . If all fcth order mixed partial derivatives exist at each point of S°, 
and all of them are continuous on 5°, then / is said to belong to C k (S) . Finally, if all mixed 
partial derivatives, of arbitrary orders, exist and are continuous on 5°, then / is said to belong 
to C°° (S) . 



236 GLOSSARY 



The absolute value of a real number x is denoted by \x\ and is defined as follows: 

• « |0| = 0. 

• (ii) If x > then \x\ = x. 



w 



• 



(iii) If x < (-x > 0) then |cc| = -x. 



o 



The overlap of two geometric sets Si and S2 is defined to be the interior (Si PI S2) of their 
intersection. Si and 6*2 are called nonoverlapping if this overlap (Si PI 52) is the empty set. 

Two Dedekind cuts (Ai, 61) and (A2, B2) are called equivalent if 01 < 62 for all ai G Ai and all 
i>2 G -B2, and 122 < 61 for all 02 € ^2 and all 61 e Bi. In such a case, we write 
(Ai, Bi) = (A 2 , B 2 ) . 



We call the inverse exp 1 of the restriction of the exponential function to R the (natural) 
logarithm function, and we denote this function by In. 

We say that the sequence {f n }converges or converges pointwise to a function / : S — ► C if for 
every x E S and every £ > there exists a natural number AT, depending on x and e, such that 
for every n > N,\f n (x) — f (x) | < e. That is, equivalently, {/„} converges pointwise to / if for 
every x G S the sequence {/„ (x)} of numbers converges to the number / (x) . 
We say that the sequence {/„} converges uniformly to a function / if for every e > 0, there exists 
an N, depending only on e, such that for every n > N and every x G S,\f n (x) — f (x) \ < e. 
If {un] is a sequence of functions defined on S, we say that the infinite series ^u n converges 
uniformly if the sequence {Sn = 5^ n =o u ™} °f partial sums converges uniformly. 
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